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ABSTRACT 

We study the bivariate size-luminosity distribution of Lyman Break Galaxies (LBGs) selected at 
redshifts around 4 and 5 in GOODS and the HUDF fields. We model the size-luminosity distribution 
as a combination of log-normal distribution (in size) and Schechter function (in luminosity) , therefore 
it enables a more detailed study of the selection effects. We perform extensive simulations to quantify 
the dropout-selection completenesses and measurement biases and uncertainties in two-dimensional 
size and magnitude bins, and transform the theoretical size-luminosity distribution to the expected 
distribution for the observed data. Using maximum- likelihood estimator (MLE), we find that the 



Schechter function parameters for _B435-dropouts are a = —1.68 



1.791q'52 X 10 Mpc ^. The log-normal size distribution is characterized by the peak i?o = 1-34 



-1-0.068 
-0.095' 



M* 



-20.60+o:i7, and 



kpc at Misoo 



-1-0.099 
-0.108 



ri 99-f0.058 
"•^^-0.056- 

1.55l:o;?7 X 10-= 



-21 mag, width crinR = 0.83Iq'q44, and the slope of the size-luminosity {RL) relation 

Similarly, for F606-dropouts we find a = -1.74^55;^^, M* = -20.53l°;24, 0* = 

Mpc-^ i?o = l-lStp.ie kpc, CTinR = 0.90iE];J^5, and (3 = The Schechter 

function parameters are consistent with the values in the literature, while the size distributions are 
wider than expected from the angular momentum distribution of the underlying dark matter halos. 
The slope of the size-luminosity {RL) relation is similar to those found for local disk galaxies, but 
considerably shallower than local early-type galaxies. 

Subject headings: galaxies: evolution — galaxies: high-redshift — galaxies: luminosity function — 
galaxies: structure — method: data analysis 



1. INTRODUCTION 

The study of the high-redshift galaxies has benefited 
tremendously from the launch of the Hubble Space Tele- 
scope (HST), especially from its exquisite sensitivity and 
angular resolution. In particular, the Advanced Cam- 
eras for Surveys (ACS) on board HST observes in the 
optical wavelength range, probing the rest-frame far-UV 
wavelengths of the emission from high-redshift galaxies 
that are sensitive to the instantaneous star-formation 
rate. Several deep surveys of field galaxies, such as 
the Great Observatorie s Origins Deep Survey (GOODS; 



iGiavalisco et al 



l2004aD and the Hubble Uhra Deep Field 



Survey (HUDF: lBeckwith et al.l l2006). took advantage of 
the powerful survey abilities of ACS to study the high- 
redshift universe. With multi-wavelength data taken by 
other instruments, the GOODS fields are among the most 
data-rich portions of the sky, looking back to the very 
early universe. 

The Lyman Break sele ction method, pioneered by 
iSteidel fc HamiltonI ()1992[ ). selects high-redshift galaxies 



using their rest-frame UV continuum emission. The Ly- 
man Break technique selects star-forming galaxies with 
redshift between 3 and 6 using near-UV and optical 
photometry (and higher-redshift galaxies using near-IR 
data). The Lyman Break Galaxies (LBGs) are in general 
actively star-forming, UV-bright galaxies that are pos- 
sibly the proge nitors of present-day massive spheroidal 
galaxies (e.g. iPettini et al.l 120011: IShaplev et all 120011: 
iPapovich et all I200H ). " Their rest-frame UV proper- 
ties shed light on t h e cosmic history of star formation 
(jMadau et al.l 119961: iGiavahsco et alj l2004bD . ahhough 
the attenuation of UV light by dust in these galaxies 
also plays an important role. LBGs are an important 
population of galaxies in the early universe, when the 
cosmic star-formation rate density was much higher than 
at 2; ^ 0. 

The luminosity function, i.e. the number-density dis- 
tribution as a function of galaxy luminosity, is one 
of the most important and the most extensively stud- 
ied statistical properties of the high-redshift galaxy 
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population. Specifically, at redshift around 4 and 
5, many studies of the UV luminosity func tion (usu- 
ally parameterized by the Schechter function; 'Schechter' 
1976) of th e LBGs already exist (Stcidcl ct al. 1999; 
Ouchi ct al 2004; Giavalisco 2005; Yoshida ct al. 2006,- 
Beckwith ct al. 2006; I wata ct al. 2007; Bouwcn s ct al.l 
2007rfOesch et al.ll2007tlvan der Burg et al.ll20Tot) . How- 
ever, differences in sample selection, photometry, and 
modeling procedures can lead to contr adictory results 
using the same datasets. For example, iBeckwith et al.l 
(pb06i) derived UV luminosity functions of LBGs at z ~ 
4, 5, and 6 selected from the HUDF and the GOODS 
fields. They concluded that the Schechter function pa- 
rameters a and M* do not evolve significantly between 
these redshifts, but the overall normalization constant (j)* 
evolves by a factor of ~ 3 in the re dshift range 4 < z < 6 . 
On the other hand, the analyses of'Bouw ens et al.l (|2007l ) 
show that the Schechter function parameters M* bright- 
ens by ^ 0.7 mag, but (/>* does not seem to evolve much. 
The discrepancies remind us of the complexities involved 
in deriving the luminosity functions. 

The size distribution (or equivalently the surface- 
brightness distribution) of LBGs is not modeled jointly 
with luminosity function in previous studies. A vari- 
ety of approaches have been adopted to account for the 
bias against selecting low-surface-brightness galaxies and 
biases in their photometry . Perhaps the most s ophisti- 
cated approach was that of lBouwens et al.) ()2007t ) , which 
uses a combination of "cloning" lower- redshift dropout 
galaxies to higher redshift, and photometrically degrad- 
ing galaxies observed in deep o bservations to simu late 
shallower observations (see also lOesch et al.l 120071 and 
their "dimmed galaxies" simulations). The appeal of 
this approach is that it uses real galaxies, with their 
sometimes clumpy and irregular morphologies. However, 
there are some drawbacks to the approach that suggest at 
least checking the results with a different approach alto- 
gether. Among the drawbacks are the following: (1) Only 
relatively small numbers of galaxies with high S/N are 
available for cloning to higher redshift. (2) At z > 1.2, 
angular diameter increases with redshift, which implies 
that the cloned images should be made bigger. Evo- 
lution toward smaller physical sizes at higher redshifts 
may be more than sufficient to counteract this effect. 
But even in that case, the formally correct procedure 
would involve deconvolving, rescaling, and re-convolving 
(or equivalently convolving with an appropriate kernel 
that accomplishes this). In practice, this is probably a 
minor effect and it has been generally ignored. (3) The 
true physical sizes of the cloned galaxies are not really 
known; all that is known is their measured sizes prior 
to cloning. (4) When a deep field is used to provide 
sources for assessing completeness and photometric bias 
in a wider field, this is of course attempting to represent 
a large volume as a bootstrap resampling of a small vol- 
ume. Rare objects are likely to be poorly represented in 
the small volume, and thus the completeness and photo- 
metric bias in those parts of parameter space are likely to 
be poorly characterized for the wide images. (5) These 
corrections make assumptions about the size distribution 
and its evolution. These become strong priors on the fit 
to the luminosity function parameters, leaving one some- 
what uneasy about the robustness of the results. 

An alternative approach, which we adopt here, is to 



model the luminosity function jointly with either the 
size or the surface-brightness distribution to include the 
surface-brightness-related selection effects. Ideally, one 
would also like to include other physical parameters that 
influence sample selection jointly such as color, Lya 
equivalent width and surface-brightness proflle. But the 
problem quickly becomes intractable as the number of 
dimension increases. In this paper we attempt to model 
the size distribution jointly with the luminosity func- 
tion in order to remove some of the biases in the UV 
luminosity function. An important motivation for fitting 
the bivariate distribution is to reveal the degeneracy be- 
tween the luminosity-function parameters and the size- 
distribution parameters. Because the observations select 
strongly against the low-surface-brightness galaxies, it is 
not obvious without detailed study that the LF param- 
eters are robust. We do not simultaneously fit for the 
distributions of color, Lya, or surface brightness profile 
here — in part because this raises the dimensionality of 
the problem enough that exploring parameter space be- 
comes difficult, and in part because the selection of func- 
tional forms for those distributions is less obvious. So 
instead of fitting those parameters, we follow the stan- 
dard practice of verifying that our model distributions 
qualitatively match the observed distributions, and focus 
on exploring the effect of including size in the luminosity 
function modeling. 

Galaxy size also provides important constraints on the 
galaxy-formation models. The canon ical pic ture of disk 
form ation introduced by iFall fc Efstathioul (Il980i) (see 
also IT983I: iDalcanton etH.! 119971: iMo et al.l Il998l) 
states that disks form from the gas within dark matter 
(DM) halos collapsing while approximately conserving 
its angular momentum; the angular momentum comes 
from the tidal interactions between dark matter halos 
around the trans-linear regime in the hierarchical struc- 
ture formation picture. The angular momentum of DM 
halos is usually parameterized by the spin parameter 
A = J|£;|1/2g-im-5/2 (|Peeblesl [19691) ■ and it follows 
log-normal distribution with peak A ~ 0.05 and cinA ~ 
0.5 in A^-body cosmological simula tions, relatively inde- 
pend e nt of halo mass or redshift (iBarnes fc Efstathioul 
[19871 : IWaTren et al.l[l99l IBuUock et al.ll200lD . Assum- 
ing that the baryons and dark matter receive the same 
amount of angular momentum per unit mass by tidal 
torque, one can relate the halo virial radius to the size 
of the disk: i?d oc Ai?vir- Therefore the size distri- 
bution should reflect the underlying A distribution, be- 
cause gas acquired most of its angular momentum with 
DM halos before collapse. On the other hand, reason- 
able galaxy sizes have been notoriously difficult to re- 
produce in the hydrodynamical simulations. Early in- 
vestigation s encountered the "angular-momentum prob- 
lem" (e.g. H avarro fc Benzl[T99l[ iSteinmetz fc Navarrj 
Il999f l that lead to unrealistically small galaxy sizes in 
the simulations. Later investigations implemented vari- 
ous forms of fe edback to alleviate the angular-mome ntum 
problem (e.g. iSales et al.ll2010l: iBrooks et al.ll20lH) . and 
improvements in numerical treatments of hydrodynam- 
ics have also shown promises in eliminating the angular 
momentum problem without resorting to late feedback 
(e.g. Kcrcs ct al. 2011; Torrcy ct al. 2011). 

The rest-frame UV size distribution of LBGs was also 
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studied by several authors ( independent of t he lu minos- 
ity function). For example. iFerguson et all ()2004[ ) stud- 
ied the mean half-light radii of star-forming galaxies (in- 
cluding LB Gs) in the GOOD S fields from z - 1 to 
~ 5. Also iHathi et aTl ([2008') studied the sizes of the 
spectroscopically-confirmed LBGs between z ~ 3 — 6. 
Both papers found that the evolution of the mean half- 
light radii roughly follows the H~^(z) relation, where in 
the concordance cosmology g^(z) = r^m(l + z )^ + ^a- 
iBouwens et"all (|2004D and lOesch et al.l \201( f) studied 
the mean size of LBGs from z ~ 2.5 to z ~ 8 and found 
that the evolution of the mean half-light radii of LBGs 
within the same luminosity range follow the (1 -I- z)~^ 
relation, roughly corresponding to H~'^/^{z) at z > 2.5. 
Building upon previous works, we investigate the size dis- 
tribution of LBGs by fitting the size distribution jointly 
with luminosity function and exploring the degeneracy 
between the two distributions, incorporating measure- 
ment biases calibrated to known half-light radii of para- 
metric galaxy profiles. The cloning technique can be used 
to create realistic galaxy profiles, but the uncertainties 
in the true sizes of the cloned galaxies make it difficult to 
calibrate our corrections. We also constrain the scatter 
around the mean size-luminosity relation, which is an 
additional constraint on the processes that govern star 
formation and feedback at early times. 

Because surface brightness affects both the detectabil- 
ity of high-z galaxies and the accuracy of the measure- 
ment of magnitude and size, we model the systematic 
biases in the measurements of both magnitude and size, 
which has not been done before on high-z LBGs. There 
were attempts to derive either the size-luminosity or the 
size-stella r mass distri butions of galaxies at z < 3 (e.g. 
Simard et al. 1999; dc J ong fc Lacevll2000l:llV^iillo et al.l 
2004 j2006i : Mosleh et al.ll20lH ). and after the first sub- 
mission of this paper, s imult aneou s and independent ef - 
forts by iMosleh et all ()2012D and iGrazian et all (|2012[ ) 
also investigated these distributions at z > 4, but not 
with the technique used here. 

Structure of the paper. — In this paper we study the 
bivariate size-luminosity distribution of LBGs selected 
at redshifts around 4 and 5. In Section [2] we describe 
our sample selection with optical color criteria. We also 
describe how we estimate the "photometric interloper" 
fractions — fraction of our sample that are low-z sources 
scattered into the color-selection window due to photo- 
metric scatter — statistically, and compare that with 
the available spectroscopic redshifts in GOODS. In Sec- 
tion[3]we describe how we measure the galaxy magnitude 
and s ize, and why we opt for using GALFIT (iPeng et al.l 
I2010D instead of SExtractor ()Bertin fc Arnoutslll996D for 
size measurement. In Section |4] we describe the size- 
luminosity distribution model we adopt, and the sim- 
ulations we perform in order to derive selection com- 
pletenesses, measurement biases, and measurement un- 
certainties in magnitude and size bins. We then ex- 
plain how we use the derived kernels to transform the 
model to produce the expected distribution of observa- 
tions, and how we perform the fit to derive the best- 
fit parameters. In Section [5] we present the results of 
the fitting together with the confidence intervals of each 
parameters. In Section [B] we discuss some implications 
of our results, including comparison with previous LF 



determinations, size evolution, width of the size distri- 
bution, and the size-luminosity relation. We summa- 
rize our work in Section [T] We adopt the cosmologi- 
cal parameters Hq = 70 kms^^ Mpc^^, Q„i ~ 0.3, and 
J7a = 0.7 for easy comparisons with previous results, and 
these values are very close to the WMAP7-|-BA0-|- i?n 
mean cosmological parameters (|Komatsu et al.l[2011[ ) of 
Ho = 70.2kms-iMpc"\ 17„ = 0.275, and = 
0.72 5. The magnitudes are in the AB magnitude sys- 
tem (|Oke fc Gunnl[T98l . 

2. LYMAN BREAK GALAXY SAMPLE SELECTION 

We select Lyman Break Galaxies using colors com- 
puted from the public GOODS ACS version 2 (v2) cat- 
alogs, excluding the HUDF region. Fo r more details of 
how th e catalogs are prepared please see lGiavalisco et ahl 
(j2004al) and the documentation on the GOODS websit(fl. 
In order to constrain the statistics at the faint end of 
the luminosity function, we also select LBGs from the 
Hubble Uhra Deep Field (HUDF; 'Beck with et "all [20061) 
dataset. HUDF dataset consists of ACS images in the 
same passbands used in the GOODS ACS dataset (B435, 
^606: *775i and zgso) covering an 11 arcmin^ area within 
the GOODS-S field. We create our own catalogs in the 
HUDF based on detection in zgso-band and photometry 
on all bands in dual-image mode of SExtractor, using 
0.5 for DETECT_THRESH and ANALYSIS_THRESH, and 0.03 
for DEBLEND_MINCONT. For detai ls on the ACS i maging 
product in the HUDF please see lBeckwith et aTl ((2006) . 

2.1. Selection Criteria 

LBG color-selection criteria are designed for selecting 
star-forming galaxies at high redshift using two features 
in the spectral e nergy distributions (SED) of star form- 
ing galaxies (e.g. iSteidel fc Hamiltonlll992l : iSteidel et al.l 
[T99alGiavaliscdl2002D . One feature is the spectral break 
at rest-frame 912 A due to the absorption of UV pho- 
tons by the stellar atmosphere of early-type stars and the 
neutral hydrogen in the interstellar medium (ISM) sur- 
rounding the star-forming regions. The second feature 
is the blanketing of Lyman-series lines by the interven- 
ing neutral hydrogen in the intergalactic medium (IGM) 
along the lines of sight to galaxies (Madau 1995; Mciksia 
12006( 1. Therefore it is possible to use the colors between a 
set of broadband filters to target a specific redshift range. 

We adopt the color-selection criteria in 
iGiavalisco et al.l (|2004bD . which are optimized to 
select LBGs at z ~ 4 and z ~ 5. The criteria for the 
^435-dropouts are 

(^435 - ^606) > 1.2 + 1.4 (^606 " ZS5o) A 

(^435 - Veoe) > 1.2 A (Veoe - zsso) < 1-2 
and the criteria for the Vgoe-dropouts are 

{[(Veoe - 4775) > 1.5 + 0.9 (1775 - ^sso)] V 

[(^eoe - «775) > 2.0]} A (^606 " ^775) > 1-2 A (2) 

(«775 - Z850) < 1-3, 

where A is the logical "AND" operator and V is the 
logical "OR" operator. The colors are computed using 

^ |http: //archive . stsci . edu/prepds/goods/| 
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the isophotal magnitudes MAG_ISO reported by SExtrac- 
tor in order to obtain consistent colors. All magnitudes 
quoted afterwards are the MAG_AUTO from SExtractor un- 
less otherwise stated. 

We select galaxies that satisfy the above color criteria 
down to ^850= 26.5 mag in GOODS and ^850= 28.5 mag 
in HUDF, while also demanding that the signal-to-noise 
ratio (S/N) in the Z85o-band be greater than 5. For Veoe- 
dropouts we also require all sources to have i3435-band 
{S/N)b < 5, as at z ^ 5 the i3435-band is entirely blue- 
ward of the Lyman break and should not have signifi- 
cant detection. This S/N criterion is more liberal than 
those in the literature (e.g. iGiavalisco et al.|[2004b ). The 
reason for adopting a liberal cut here is to increase the 
completeness of our sample, as our simulations show a 
non-negligible fraction of the true 2; ~ 5 sources whose 
S/N in i3435-band are slightly higher than 2 (the value 
commonly used in the literature) because the SExtractor- 
estimated S/N becomes unreliable for the non-detected 
sources. Among the 52 sources with 2 < S'/iV < 5 in 
i3435-band, 6 are broken-up diffraction spikes, 46 have 
S/N < 3, 8 have 3 < S/N < 4, and only 2 have 
4 < S/N < 5. Furthermore, all three of the sources with 
spectroscopic redshifts that have S/N > 2 in i?435-band 
are at Zgpec > 4.4 and not low-z interlopers. Because the 
choice in the S/N limit is photometry dependent and has 
some intrinsic uncertainty, we decide to use a more lib- 
eral threshold in the i3435-band S/N to increase our com- 
pleteness without introducing a large number of interlop- 
ers. We also exclude sources with SExtractor-reported 
CLASS_STAR > 0.95 down to Z85o< 26 mag in GOODS 
and zg,5o< 28 mag in the HUDF to eliminate field stars. 
The resultant sample still contains two AGN candidates 
(lAU ID J 033216. 86-275043.9 and J033243.84-274918.1; 
IWolf et ah 2004); we therefore exclude these two sources 
from further analyses. We next visually inspect each 
selected dropout source to remove other image artifacts 
that are not removed in the image-reduction process, and 
this removes 16 and 19 sources from the -B435-dropout 
and Vgoe-dropout sample, respectively, all of which are 
broken diffraction spikes from bright stars. 

At the bright end of our samples {zs5o< 24.5 mag in 
GOODS and 2;85o< 25.5 mag in the HUDF) the ma- 
jority of sources have spectroscopic redshifts. For the 
few sources t hat do not, al l of th em have photometric 
redshifts from' Dahlen et ahl (|2010f ) (we describe the red- 
shift information of our sample in Section 12. 2|) . There- 
fore we eliminate the bright sources that are confirmed 
low- 2; interlopers (defined as z < 3 for the i3435-dropout 
sample and z < 4 for the Veoe-dropout sample) based 
on available redshift information. Doing so removes 
1 B435-dropout and 3 Vgoe-dropouts from GOODS. In 
fainter magnitude bins (z85o> 24.5 mag in GOODS and 
Z85o> 25.5 mag in the HUDF) we do not have redshift 
information for most sources, so we will model the in- 
terloper fractions in Section 12.31 and include them as an 
additional component of our model. 

In addition, if a given source exists in both the GOODS 
and HUDF source lists, we use the values measured in 
the HUDF because it is deeper. With the above cri- 
teria we have a total of 1063 i3435-dropouts and 465 
Veoe-dropouts. The number of dropouts in the two 
GOODS fields (excluding the deeper HUDF), 384 B435- 
dropouts and 196 Veoe-dropouts in GOODS-N and 454 



^435-dropouts and 157 Vgoe-dropouts in GOODS-S, are 
consistent within the cosmic variance estimated by th e 
cosmic variance calculator of iTrenti fc Stiavellil (|20Q8D . 
which is about 15%. The numbers are summarized in 
Table [H 

2.2. Redshift Distributions 

There have been several campaigns to measure 
spectroscopic redshifts (zgpec) in the G OODS fields 
that cover the sources of i nterest here (iSteidel et al ' 
1999"; 'Cristiani et al. 2000; Cohen et al.' 2000'; 'Cohenl 
2001; Stei dcl et al. 2003; Wirth et al. 2004; Daddi et al 
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Popesso et aLii2009t Stern et al. 2012, in preparation). 
In addition. IDahlen et all (j2010D produced the deepest 
photometric-redshift (zphot) catalog in GOODS-S (the 
photometric-redshift catalog of Ca rdamone et al] 120101 
in GOODS-S, although derived with more bands, does 
not go as deep as Dahlen et al.ll2010l ) In Figure [1] we 
show the redshift distribution of our selected sources 
(we only show spectroscopic redshifts with good or 
acceptable qualities if such fiags exists in the catalogs.) 
In general the color-selection criteria are able to select 
-B435-dropouts within the redshift range 3.4 < z < 4.4 
and Vgoe-dropouts within 4.4 < z < 5.6. We do not 
use the redshift information to identify interlopers and 
exclude them from analysis in the faint end, because 
the spectroscopic redshifts are incomplete and the 
photometric redshifts have larger errors. Instead we 
try to model the interloper fractions in Section 12.31 and 
include their contribution as an additional component 
in our model. 

High-redshift, star- forming galaxies have a higher frac- 
tion of Lyct-emitt ers (LAEs) t han lo cal star-forming 
galax ies (see e.g. iReddv et al. 2008: Finkel stein et al.l 
i200alCowie et a l.ll2010t l~du llo et al. 2012). The frac- 
tion of star-forming galaxies at z > 5 with large rest- 
frame Lya equivalent widths could also be a function 
of redshift and lum inosity (see e.g. iStark et "all 120101 : 
'Schae rer et alj |2011[ ). At z - 5.6 - 5.8, Lya is red- 
shifted into the lyrs-band, therefore a galaxy with strong 
Lya emission (rest-frame equivalent width Wi^ya ^ 50 
A) will have redder Vgoe— ^775 and bluer i775-Z85o col- 
ors than the colors determined by the continuum alone, 
making them more likely to be sel ected as Vfioa-dropouts 



instead of Z775-dropo uts (s ee e.g. I Malhotra et al.l 
Dow-Hvaelund et a ll 120071: f Stanwav et all 1200 



2005 



2008 



.Bouwens et al.. 200'^lstark et al.,.2010i) . Among our Vgoe" 
dropouts with spectroscopic redshifts, 6 source out of 40 
(15%) have Zgpcc > 5.5, representing a secondary peak of 
Zgpcc distribution. It is possible that these sources have 
strong Lya-emission-perturbed broadband colors. How- 
ever, at z > 5.5 there is yet to be a complete census of 
Lya equivalent widths, so we can not model the effects of 
Lya emission on our sample selection accurately. A sim- 
ple estimate of the possible fraction of Veoe-dropouts that 
are z ~ 6 galaxies with strong Lya emission — using the 
best-fit exponential distribution o f Lya equivalent widths 
of LAEs at z - 3.1 (jCiardullo et al.'2012 ) and the num- 
ber densities of Z775-dropouts from iBouwens et al.l ()2007f ) 
— suggests that these galaxies make up at most about 
5% of our Veoe-dropout sample in GOODS. Therefore 
we do not consider z ~ 6 star-forming galaxies with 
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TABLE 1 

The number of selected dropouts. 



Dropout 


Field 


2850 limit'' 
(mag) 


Number 
(SExtractor sample) 


Number'' 
(GALFIT sample) 


Number'^ 
(GALFIT quality sample) 


-B-dropouts 


GOODS 
HUDF 


26.5 
28.5 


838 
225 


733 
202 


586 
146 


V-dropouts 


GOODS 
HUDF 


26.5 
28.5 


353 
112 


325 
96 


219 
61 



^ zgSO-band MAG_AUTO limit for sample selection. 

^ Number of sources with GALFIT-returned magnitudes (in iyrs-band for B435 -dropouts and in zgso-band for 
Veoe-dropouts) brighter than 26.5 mag in GOODS and 28.5 mag in HUDF. 
Number of sources that have acceptable GALFIT fits. 



-S 80 



B-dropouts 



spec-z 
phot-z 



V-dro pouts 




1 2 3 4 5 6 7 

redshift 

Fig. 1. — Redshift distribution of £5435 -dropouts and Veoe- 
dropouts selected from GOODS-S. The bin width o f the redshift 
axis is Az = 0.2. Photometric redshifts are from Dahlen et al.l 
1(2010). Ref erences for the spectroscopic redshift data are given in 
Section [Ol 

strong Lya emission to contribute significantly to the 
V(j06-dropout sample. 

2.3. Interloper Fractions from Photometric Scatter 

Sources other than high-z star-forming galaxies could 
also be selected by the LBG criteria. One type of in- 
terlopers are sources with intrinsically red SEDs such as 
dwarf stars (of spectr al types M, L , and T) and z < 2 
quiescent galaxies (iRvajTet al.ll2005HPirzkal et al.l[2005L 
12001 ISteidel et al.l 119991: lYoshida et al. II2006D. Another 
type of interlopers are sources with intrinsic SEDs out- 
side of the target range, but with colors close enough 
to the selection boundaries that they are scattered into 
the color-selection window due to photometric error. To 
minimize the first type of interlopers, we identify low- 
z interlopers with available redshift information in the 
bright end in Section 12.11 At the faint end, identifying 
individual interlopers of either type becomes intractable 
due to the limited spectroscopic information. In this sec- 
tion, we estimate the interloper contributions from pho- 
tometric scatter (hereafter "photometric interlopers"). 

We estimate the photometric interloper fractions at 
Z85o> 24.5 mag in GOODS and Z850> 25.5 mag in the 
HUDF, in magnitude bins of width 0.5 mag, with a set of 
simulations (the "photometric interloper simulations"). 
Our simulations are v ery similar to the ones performed in 
iBouwens et al.l (|2006[ ) (where they estimated the cont am- 
ination fraction in the HUDF) and llwata et al.l (|2007[ ). In 
the photometric interloper simulations we use all sources 
with S/N > 5 in zgso-band and S/N > 3 in B435-, Vgoe-, 
and i775-bands that are not selected as dropouts. Only 



TABLE 2 

The photometric interloper fractions at zg5o> 24.5 mag 

IN GOODS AND 2850> 25.5 MAG IN THE HUDF FROM THE 
PHOTOMETRIC INTERLOPER SIMULATIONS 



Dropout 


Field 


-^interloper 


-^dropout 


/interloper 


i3435-dropouts 


GOODS 


35.1 


821 


4.3% 


^435-dropouts 


HUDF 


15.8 


223 


7.1% 


Veoe-dropouts 


GOODS 


22.9 


352 


6.5% 


Veoe-dropouts 


HUDF 


0.9 


111 


0.8% 



Note. — A'jnterloper is the expected number of photometric 
interlopers from simulations, while A^dropout is the number of 
real LBGs selected in the given magnitude range. 



sources with either spectroscopic redshift or photomet- 
ric redshift much lower than z — 3 are used in order to 
minimize the contribution of sources that are just be- 
low the selection redshift range. The sources selected 
this way have reasonably constrained SED shapes that 
span a wide range. For each selected source we then 
rescale its zsso-band magnitude to a randomly selected 
value between 24.5 and 26.5 mag (in GOODS) and be- 
tween 25.5 and 28.5 mag (in HUDF). After rescaling the 
magnitudes in other bands by the same factor, we ran- 
domly perturb its magnitude in each band according to 
the appropriate photometric error in the given band and 
magnitude bin (assuming Gaussian error in magnitude). 
We calculate the number of these sources that fall within 
the color-selection window in each magnitude bin, and 
also calculate their fractional contribution among the real 
dropouts, defined as 



/i 



interloper 



num. of expected photometric interlopers 
num. of selected dropouts 

The same procedure is repeated 5000 times to collect 
good statistics. 

The results of the photometric interloper simulations 
are presented in Figure [2] The estimated overall pho- 
tometric interloper fractions are below 8%, and for the 
Vfjoe-dropouts in the HUDF the photometric interloper 
fraction is especially low, possibly owing to the extremely 
deep i775-band data in HUDF that constrain the 1775- 
zgso color well. In our photometric-interloper simula- 
tions, we do not reject sources using the i3435-band S/N 
threshold for Vgoe-dropouts, so the true photometric in- 
terloper fractions are likely even lower. In general, the 
photometric-interloper fractions stay relatively constant 
over all magnitude bins probed. 

As verification of the photometric interloper fractions 
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Fig. 2. — The estimated interloper fractions due to photomet- 
ric scatter as a function of the zgso-band magnitude. The top 
panel shows the results in GOODS-depth images; the bottom panel 
shows the result in the HUDF-depth images. Black histograms and 
lines are for B435-dropouts; gray histograms and lines are for Vsos- 
dropouts. Solid histograms are the observed number of dropouts; 
dashed histograms are the expected number of interlopers; dotted 
lines are the estimated interloper fraction in each magnitude bin. 
In most bins the interloper fractions are below 10%; these fractions 
are likely upper limits because sources that are scattered into the 
color-selection window could still be rejected using other criteria 
that cull genuine high-z galaxies. The numbers (of sources in each 
bin) are read off from the right ordinate, and the interloper frac- 
tions are read off from the loft ordinate. In bins where no real 
dropouts are selected, the interloper fractions are set to zero, and 
no additional number densities from photometric interlopers are 
added to the model. 

from simulations, we compare the estimated photometric 
interloper fractions /interloper with those estimated from 
available spectroscopic redshifts (zgpec) alone. Among 
the i?435-dropout sample in GOODS, 90 sources have 
secure spectroscopic redshifts at 2;850> 24.5 mag, 1 of 
them have Zspoc < 3 (1.1%). Similarly, among the Veoe- 
dropout sample in GOODS, 37 sources have secure spec- 
troscopic redshifts at Z850> 24.5 mag, and none of them 



have 



'Spec 



< 4. In the HUDF, only 4 i3435-dropouts and 



4 Vgoe-dropouts with z^^q> 25.5 mag have 



■'Spec ; 



and 



none of them are low-z interlopers. All of the interloper 
fractions estimated from available spectroscopic redshifts 
are consistent with what we find from the photometric 
interloper simulations. In all cases, the estimated inter- 
loper fractions from the simulations are higher than those 
estimated from the sources with spectroscopic redshifts. 
Because this set of simulations accounts for objects scat- 
tered into the color-selection window, but does not use 
other criteria such as stellarity or image flags to reject 
sources, the interloper fractions estimated are likely up- 
per limits. 

3. MAGNITUDE AND SIZE MEASUREMENT 

Defining and measuring the size of a galaxy from the 
image is a long-standing problem. In the past there have 
been several definitions of galaxy size, all with the general 
aim of being unbiased and easy-to-measure. For exam- 
ple, the Petrosian radius (|Petrosian.,1976; .Graham et all 



120051 and references therein) is defined through the Pet- 
rosian index r7(i?) = {I)n/I{R), the average surface 
brightness within radius R over the surface brightness 
at R. Petrosian radius is then defined as a multiple of 
Rp at which r]{Rp) (or l/r]{Rp)) equals some chosen 
value. In practice, one cannot measure I{R), but in- 
stead one measures (/(i?i, i?2)), the mean surface bright- 
ness in some range of radius, which is subject to resolu- 
tion effects at small R and S/N limitations at large R. 
Another co mmon definition is the half-light radius (e.g. 
IBershad"v~e t al. 2000) that is the radius within which half 
of the total light from the galaxy is enclosed. However it 
is sensitive to the aperture adopted to measure the total 
light from the galaxy. On the other hand, the effective 
half-light radius Re , the radius that encloses half the to- 
tal light of the Ser s ic light profile (Scrsic 196 ^ see also 
iTruiillo et al][200TI: iRavindranath et all |2004 l2006l ). is 
independent of the aperture adopted, although it is sub- 
ject to the parametric assumption of the Sersic profile. 

In this work we adopt the effective half-light radius Re 
of a Sersic profile along the semi-major axis (which en- 
closes half the total flux) as our definition of galaxy size. 
The Sersic profile can be expressed in the following equa- 
tion: 



I{R) = leexp < ~b„ 



l/n 



(3) 



where Re is the effective half-light radius within which 
half the total flux is enclosed, /g is the surface brightness 
at the effective half-light radius, n is the Sersic index 
which characterizes the overall profile shape, and &„ is the 
normalization constant that ensures that i?e encloses half 
the t otal fiux (see also Graham et alj l2005t iRvan et al.l 
l2010f) . Several familiar light profiles such as Gaussian 
profile (n = 0.5), exponential profile ( n = 1), and the 
classi cal de Vaucouleurs profile (n = 4; Ide Vaucouleurd 
Il948f l are special cases of the Sersic profile. Using a para- 
metric form to estimate galaxy size has the advantage 
of being independent of the choice of aperture, taking 
into account the effect of the PSF, and allowing one to 
weigh individual pixels according to their S/N , resulting 
in more optimal photometry. However it is subject to the 
assumption of profile shape, which may not always be a 
good description of real galaxies. We will show through 
simulations that using the Sersic profile-defined galaxy 
size is less biased than the SExtractor-measured half- 
light radius in SectionlH at least for galaxies with smo oth 
profiles. We choose GALFIT v3 (IPeng et al.l[20Toh for 
size and magnitude measurements. GALFIT is a 2D 
galaxy light-profile fitting software that derives the best- 
fit parameters of the input functional form that matches 
the observed galaxy light distribution. GALFIT has been 
demonstrated to give reasonably unbiased size and mag- 
nitude measurements down to the signal-to-noise ratio 
range typical for h igh- z galaxies (|Haussler et al.l [20071 : 
[Ravindran ath et al.lf2006l ). 

The summary of our procedure is as follows: 

• We only use one Sersic component to fit each 
galaxy. Althoug h several obs e rvational studies 
(e.g. [Ra vindranath et all 120061: iLaw et al.l 120071: 
iGuo et al.li2012i) . a s well as stu dies of galax y evo- 
lution models (e.g. iDekel et"alli2009; .Agertz et al.l 
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I2009D . have shown that a fraction of high-z star- 
forming galaxies are clumpy, we do not use multiple 
Sersic components to obtain better fits. Additional 
Sersic components make it much harder to model 
the systematic effects, and they do not help us gain 
more insight on the true size of the galaxies. 

• We use GALFIT to measure the size and magni- 
tude in the passband that is closest to rest-frame 
1500A in the UV. This means we use the observed 
i775-band for i3435-dropouts and the observed zsso- 
band for Vgoe-dropouts. 

• We adopt SExtractor attributes of each dropout 
galaxy as initial guesses in GALFIT. For example, 
we take SExtractor MAG_AUTO as the initial guess 
of total magnitude and half-light radius as the ini- 
tial guess of Re- GALFIT is robust enough that 
it is not very sensitive to the initial guesses as 
long as they are reaso nably close to the true values 
([Htussler et al.|[2007l ). 

• We make cutouts around every dropout galaxy and 
run GALFIT on the cutout images. We expand the 
cutout image to include close neighbors as well as 
bright neighbors that could perturb the fitting re- 
sult of the main galaxy. Neighbors are then mod- 
eled as separate Sersic components. GALFIT is 
able to fit multiple Sersic components simultane- 
ously, so including the close neighbors helps GAL- 
FIT obtain more reasonable fits for the objects of 
interest. 

• We do not use the dropouts for which GALFIT fails 
to converge or returns magnitudes fainter than 26.5 
mag in GOODS and 28.5 mag in the HUDF. There- 
fore we do not include the 128 B435-dropouts and 
the 44 Vgoe-dropouts that have GALFIT-returned 
magnitudes fainter than the above limits. We also 
only include galaxies with acceptable GALFIT re- 
sults. The fitting quality is determined by both the 
ratio between i?e-error and R,, {[Re\err / Re) and 
the reduced chi-square (x^) reported by GALFIT. 
Therefore we do not include dropouts that have ei- 
ther [Rf.]err/Re > 0.6 Or Z775-band > 0.4, and 
0850-band > 0-5 in the GOODS images, and ei- 
ther [Re\err/Re > 0.6 Or xj > 5.0 (in both Z775 

and Z85o) in the HUDF images in the fitting pro- 
cess. (The difference in the values of x^ adopted for 
GOODS and HUDF images likely originates from 
the different levels of correlated noise between the 
two datasets. The median x^ for GOODS and 
HUDF are 0.33 and 1.22 as derived from simula- 
tions, respectively, and the Xy limits correspond 
roughly to the 90% limit in both fields.) Doing 
so rejects another 203 i?435-dropouts and 141 Vgoe- 
dropouts, leaving us with 732 _B435-dropouts and 
280 Vfjoe-dropouts in the final fitting. 

Let us look more carefully at the GALFIT complete- 
ness, or the fraction of sources for which GALFIT is able 
to return good fits. Clearly, at some level, the fact that 
real LBGs do not always have smooth Sersic-profile mor- 
phologies must be contributing to the failure rate. How- 
ever, as described below, we find that the failure rate in 



our smooth-profile simulations is very similar to the fail- 
ure rate for the real galaxies. So we do not think that the 
morphology difference is the primary cause of GALFIT 
failures. Instead, we attribute the failure rate to a vari- 
ety of other factors, such as the infiuence of neighbors, 
or problems with centering or background subtraction. 
Examination of individual failures in the simulations did 
not reveal obvious trends that we could easily address by 
tweaking the GALFIT parameters or the fitting proce- 
dure. 

In order to understand how well we understand the 
GALFIT completeness of our sample, and the system- 
atic effects it might have on our number counts, we ex- 
amine the failure rates for the simulations described in 
Section 14.31 In Figure [3] the number of i3435-dropouts 
with good fits in bins of GALFIT-measured magnitude 
and size as well as the expected number of good fits de- 
termined from our simulations. The left-hand column 
shows the numbers for the GOODS-depth data (top left 
is for magnitude bins and bottom left is for size bins); 
the right-hand column shows the numbers for the HUDF- 
depth data (top right is for magnitude bins and bottom 
right is for size bins). The solid lines show the numbers 
of i?435-dropouts in each bin; the dashed lines show the 
expected numbers based on the GALFIT completenesses 
from simulations. We calculate the expected numbers 
of galaxies with good fits by multiplying the number of 
galaxies in each (GALFIT-measured) magnitude-size bin 
by the GALFIT completeness (derived from simulations) 
in that bin. Then the total number in a magnitude bin 
is the sum of the numbers in all bins with the same mag- 
nitude but different sizes (the numbers in size bins is 
calculated in a similar way). The error bars correspond 
to Poisson errors (a/ZV). One can see that in most bins 
the numbers of observed i3435-dropouts with good fits are 
consistent with the expected numbers of good fits from 
simulations within the errors. However, in the GOODS- 
depth data there is a slight systematic offset that the 
observed numbers of _B435-dropouts with good fits are 
higher than those expected from simulations, for bins 
with magnitude fainter than 25.5 mag and log]^g(i?e) be- 
tween 0.4 and 1 (where Re is in pixels); in those bins, 
the differences in the numbers is ~ 10% of the numbers 
of -B435-dropouts with good fits. The same offsets are 
not observed for the HUDF-depth data. It is unclear 
how much systematic error the offsets introduce into our 
best-fit size-luminosity distribution, but we do not think 
that it drastically changes the main conclusions of this 
paper. 

It appears that fitting i3435-dropouts and Veoe- 
dropouts is really pushing the limits of GALFIT. Be- 
cause we model the completeness due to GALFIT mea- 
surements also in magnitude and size bins, these effects 
will be included in our transformations and will also be 
reflected in the final results. 

The GALFIT-measured effective half-light radii and 
magnitudes for our _B-dropout and ^-dropout samples, 
excluding poor fits, are shown in Figure|4l Some objects 
have very small best-fit Re, in some cases i?e < 1 pixel. 
Most of them lie very close to the selection limit in the 
given dataset (2850= 26.5 mag in GOODS and ^850= 28.5 
mag in HUDF) where rejection using CLASS_STAR is not 
reliable, and from visual inspection all of them are unre- 
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Fig. 3. — The number of sources for which GALFIT is able 
to return good fits in bins of GALFIT-measured magnitude (top 
row) and size (bottom row) for i3435-dropouts. The numbers for 
GOODS-depth data are shown in the left column; the numbers 
of HUDF-depth data are shown in the right column. The solid 
lines show the numbers of i?435-dropouts; the dashed lines show 
the numbers expected from simulations. The expected numbers of 
galaxies with good fits (dashed lines) are calculated by first multi- 
plying the numbers of i3435-dropouts (whether they have good fits 
or not) in each magnitude-size bin by the GALFIT completeness 
of that bin, and then they are marginalize over size (or magni- 
tude) to obtain the expected number of good fits in each magni- 
tude (or size) bin. In each panel we also show the Poisson errors 
of _B435-dropouts. In most magnitude or size bins, the expected 
numbers from simulations are consistent with the numbers of B435- 
dropouts within Poisson errors; however, for the GOODS-depth 
data the observed numbers are systematically higher than the ex- 
pected numbers from simulations for magnitudes fainter than 25.5 
mag and logj^f)(i?e) between 0.4 and 1. In these bins the differ- 
ences are ~ 10% of the numbers of iJ435-dropouts with good fits. 
On the other hand, no clear systematic offsets are observed for the 
HUDF-depth data. The plots for Veoe -dropouts (not shown here) 
are qualitatively similar and there are no clear systematic offsets 
for Veoe -dropouts between the expected and the observed numbers 
with good fits. 
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Fig. 4. — GALFIT-measured effective half-light radii and mag- 
nitudes of the i3435- and Vgoe-dropout galaxies. Only the sources 
whose GALFIT results satisfy the criteria described in Section [3] 
are shown. Dark points are dropouts from the GOODS dataset; 
gray crosses are dropouts from the HUDF dataset. The pixel scale 
is 0.03" . The median of the magnitude errors reported by GALFIT 
of the B435-dropouts selected from GOODS and HUDF are 0.31 
and 0.20 mag, respectively, while the median errors in Re (returned 
by GALFIT) for B435-dropouts selected from GOODS and HUDF 
are 0.5 and 0.3 pixels, respectively. Similarly, the median magni- 
tude errors for Vsos-dropouts selected from GOODS and HUDF 
are 0.55 and 0.42 mag, respectively, while the median errors in Re 
for Vfioe-dropouts selected from GOODS and HUDF are 1.4 and 
0.8 pixels, respectively. However, the errors are a strong function 
of surface brightness, so the above errors are not representative of 
all galaxies in our sample. We will derive the measurement errors 
in each magnitude-and-size bins with simulations in Section 14.31 
Please see the online version for the color figure. 



tected in HUDF, we use the measurements in the HUDF. 
We compare the measurements of these sources from the 
GOODS images and the HUDF images in Figure [6l 14 
i?435-dropouts and 3 Vgoe-dropouts are detected in both 
images, among them GALFIT failed to return any resuh 
for 1 i?435-dropout and did not return a good fit for 5 
^435-dropouts. For the dropouts that return good fits, 
there is good agreement between the two measurements. 
The GALFIT-measured sizes ah he within ±0.1 dex from 
each other except for one source. 



solved. We exclude known stars and AGNs in the bright 
end to the best of our knowledge, and some bright sources 
are genuine compact, high-redshift sources. One exam- 
ple is GOODS J033218.92-275302.7 in the V'eoe-dropout 
sample that has = 24.43 mag and Re — 0.0144" (or 
0.48 pixel). This o bject has measured s pectroscopic red- 
shift Zspec = 5.563 (iVanzella et al.|[2009[ ) and is one of the 
brightest and the most compact galaxies in our sample. 

The distribution of GALFIT-measured Sersic index 
for our samples is shown in Figure [SJ It is custom- 
ary to classify the morphology of galaxies using the 
best-fit Sersic index n: galaxies with low n (usually 
n < 2.5) are classified as "late-type", or "disk-like", 
while galaxies with high n (usually n > 2.5) are clas- 
sified as "early-type" or "bulge-like". Using this defini- 
tion, among our i3435-dropout sample 78% is classified 
as late-type, while among our Veoe-dropout sample 72% 
is classified as late-type. This is cons istent with earlier 
studies (e.g. iRavindranath et al.ll2006l ) that the majority 
of the LBGs have Sersic indices of late-type (or disk-like) 
galaxies. Since we do not constrain the Sersic index, a 
small fraction of objects have very large Sersic indices. 
The fraction of objects with extremely large n {n > 8) 
is small — only about 3% — and should not affect our 
results dramatically. 

When a given source detected in GOODS is also de- 



4. MODELING AND FITTING PROCEDURES 

In this section we describe the procedure to build the 
model size-luminosity distribution in detail and much of 
this section is rather technical. Here we highlight that 
the new feature of our work is to account for system- 
atic effects of dropout selection and size measurement in 
magnitude and size bins. The steps involve: 

1. constructing a theoretical 2D model that combines 
the luminosity function and size distribution; 

2. transforming the distribution from step [T] using the 
dropout-selection completenesses and redshift dis- 
tributions (derived in each intrinsic absolute mag- 
nitude and effective half-light radius bin) to ac- 
count for sample selection effects; 

3. transforming the distribution from step [5] using 
the GALFIT-measurement transfer function ker- 
nels (derived in apparent magnitude and effective 
radius bins) to account for measurement bias and 
uncertainties; 

4. using the transformed model from step [3] to com- 
pare with the observed size-luminosity distribution 
and derive the best-fit parameters. 
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Fig. 5. — The distribution of GALFIT-measured Sersic index 
n. The distribution of i3435-dropouts is shown in dark soUd his- 
togram, while the distribution of Veo6 -dropouts is shown in gray 
dashed histogram. The fraction of late-type galaxies (defined as 
those having n < 2.5) is 78% among B435-dropouts and is 72% 
among Vgoe-dropouts. This motivates us to use 70% exponential- 
profile and 30% de Vauc ouleurs-profile objects in the simulations 
described in Section 14.31 



4.1. Bivariate Size- Luminosity Distribution 

In this study we combine the UV luminosity lunc- 
tion and size distribution to form the bivariate size- 
luminosity distribution in rest-frame 1 500A. We adopt 
the Schechter function ()Schechteiill976[) as our model lu- 
minosity function, as is customary in most high-z UV 
LF studies and was shown to be a decent parametriza- 
tion (e.g. ISt eidel et al] 119991: IBouwens et"all 120071: 



IR,eddv fc SteideL,2009il : 



exp 



-) - (4) 

L* J L* ^ ' 



where (f){L)dL is the volume number density of galaxies 
within the luminosity range (L, L + dL), L* is the char- 
acteristic luminosity, and a is the power-law slope at the 
faint end, and (j)* is the normalization. 

For the size distribution of LBGs at a given luminosity, 
we adopt the log-normal distribution: 



p{Re)dRe 



1 



: exp 



ln^(i?e/fle) 



(5) 



where p{Re)dRe is the probability density that a galaxy 
at luminosity L has effective half-light radius between 
(i?e; Re + dRe), and Re, (Tin Re ^rc the peak and width 
of the distribution, respectively. The log-normal form 
of size distribution is motivated by the disk-formation 
theory of Fall & Efstathiou (1980), in which gas approx- 
imately conserves its specific angular momentum when 
it forms in the centers of dark matter halos, and disk 
size is proportional to the spin parameter A and the halo 
virial radius i?vir: Rd oc Ai?vir- In this picture, the 
size distribution of the disks should reflect the under- 
lying A distribution of the halos, which is found to be 
well approximated by a log-normal distrib ution indepen- 
dent of redshift or halo mass (Barnes & Efstathioulll987l : 
iWarren et all [19911 Bullock et al...2001.) . 

To co nnect equations (ffl and (El), we fo llow the argu- 
ments of' de Jong fc LacevI (|2000[ ) (see also iCholoniewskl 
[l985.) that adopts a power-law relation to connect the 
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Fig. 6. — Comparison between the GALFIT-measured size (left 
panel) and magnitude (right panel) in GOODS and HUDF for 
dropouts detected in both images. The dots are the iJ435-dropouts 
and the crosses are the Veoe-dropouts. The sizes measured in both 
images generally agree within 0.1 dex. 



luminosity L and the peak of the size distribution R: 



Re{L) — Rf) 



(6) 



The peak of the size distribution .R at a nominal lumi- 
nosity Lq is defined as .Ro = Re{Lo)- We choose Lq to 
correspond to Mq = —21, or = 9.12 x 10~^^ erg s~"^ 
Hz^i at 1500 A. Using Eq. \^to connect Eq. [Hand Eq. 
[HI and expressing the distribution in terms of absolute 
magnitude M, we come to the full bivariate expression: 

^{M, logio Re)dM dlogio Re = 0.41n(10)x 



2Q-0.4(q+1)(M-A/*) 



exp 



40 



-0.4(M-M*) 



In(lO) 



dMdlogio R, 



exp • 



[log,o{Re/Ro)+0.mM-Mo)] 
2<i?yin'(10) 



. (7) 

In Eq. [3 the bivariate distribution is characterized by 
a set of five free parameters P = {a, M* , Rq, ain r^, (3) 
(again we determine 0* after we determine the shape of 
the distribution). We show in Figure[7|the distribution in 
the (M, log^Q Re) plane corresponding to the fiducial pa- 
rameters Po = (-1.7, -21.0, 0.21", 0.7, 0.3) down to ap- 
parent magnitude of 26.5 mag. The regions with brighter 
shades in Figure [3 denote higher number densities. We 
choose the values Pq because these are the initial guesses 
of the parameters in the fitting process. If the intrinsic 
distribution of the galaxy population is indeed described 

by Eq. [Tjwith Pq, one would expect the observed points 
to follow the top panel of Figure [71 if the sample is com- 
plete and the measurement error is zero. However there 
are always sample incompleteness and there are always 
measurement errors. In the following sections we de- 
tail the procedure we use to incorporate sample incom- 
pletenesses and measurement biases and errors into our 
model. The transformed distribution is shown in the bot- 
tom panel of Figure [71 which is the expected distribution 
if the intrinsic distribution is characterized by Pq- We 
compare the transformed distribution with the observed 
distribution to derive the best-fit parameters. 
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Fig. 7. — This is a visual representation of the size-luminosity 
model before and after the transformation. The model shown 
here is parameterized by equation [7] with parameters P = 
(a,M*,Ro,(TinR^,l3) = (-1.7, -21.0, 0.21", 0.7, 0.3) down to m = 
26.5 mag — the adopted magnitude limit of the sample from 
GOODS. The locations with brighter shades denote higher number 
densities. We choose the values Pq because these are the initial 
guesses of the parameters in the fitting process and they are close 
to the best-fit parameters found later. The top panel shows the 
model distribution without the corrections of selection effects and 
measurement biases and uncertainties. The bottom panel shows 
the distribution after accounting for the aforementioned system- 
atic effects derived from the GOODS-depth i775-band image. The 
mathematical tr ansf orm ation s applied will be described in more 
detail in Section 14.21 and 14.31 

4.2. LBG Selection Function and Dropout Simulations 
4.2.1. Description of Simulation 

The first set of simulations ("dropout simulations") 
are Monte Carlo simulations in which we insert arti- 
ficial galaxies in the images and try to select them 
in our LBG sample. We make usual assumptions on 
the SED of the input s ources in order t o mimic the 
high-z LBG population ^.^^ '2005': 'Verm a et al 



, LBG population . . 

2007HPentericci et al.ll2007l:lYabe et a l. 2009; Star k et al 



2009 ), similar t o the o nes used in the simulations in 
Giavalisco et al.l (|2004b( ). However we still describe the 



simulations in detail here: the inp i it SE P is taken from 
the model of IBruzual &: Chariot! (j2003D with constant 
star-formation history of duration 140 Myr; the rest- 
frame 1500 A absolute magnitudes are drawn from a 
flat distribution between —25 and —15 mag; the red- 
shift is drawn from a flat distribution between 3 and 6; 
the dust, parameterized by E(B — V), is drawn from a 
Gaussian distribution with peak 0.15 and dispersion 0.05. 
These values correspond to the range of rest-frame UV 
slope (/3) between —2.3 and —0.8, cons istent with the 
measured range in the litera t ure fsee e.g. IBouwens et al.l 
[20091: iFinkelstein et al.ll20l2l:|Castellano et al.ll2012D . We 



use the iCalzetti et al.l ( 20001 ) ext inction law to attenu- 
ate the spectrum; we also use the iMeiksin] ()2006D recipe 
to model cosmic neutral-hydrogen attenuations. The 



neutral- hydrogen Lyman-series optical depths of lMeiksIn 
( 2006 ) are smaller than the earlier estimates from lMadau 
(|1995D . but for our filter set and redshift of interest the 
differences in color corrections are fairly small. Only at 
the high-redshift end of the Veoe-dropout sample do the 
two corrections differ appreciably, with the corrected col- 
ors using[Madau (1995) redder by ^ 0.2 mag. We expect 
that adopting , Meiksin. ((2006i) will result in lower selection 
completeness for the Veoe-dropout sample, but a detailed 
comparison is outside the scope of this work. With the 
corrected SED, we calculate the magnitudes in ^435-, 
^606": *775-, and Z850-bands as our input values. 

We use the IRAF package artdata to generate images 
of artificial galaxies and insert 40 galaxies at a time into a 
4096-by-4096-pixel cutout. The input magnitudes range 
from 22 to 27 mag for the GOODS fields, and 24 to 29 
mag for the HUDF. The input effective half-light radii 
range from 0.05" to 1.5". Each object can have one of 
two profile shapes — the exponential profile and de Vau- 
couleurs profile — and we assign each object a position 
in the image and position angle drawn randomly from a 
flat distribution. We draw an axial ratio f or each source 
from the model distribution in Ferguso n et al.l (|2004[ ) 
that match the observed distribution of their z ~ 4 sam- 
ple. We use 70% e xponential and 30% de Va ucouleurs 
profile as observed in lRavindranath et al.l ()2006D (see also 
Figure [5]) . In order to properly populate the parameter 
space in magnitude, effective half-light radii, and red- 
shift, we iteratively run the simulations until we have 
collected at least 500,000 sources. We derive one set 
of dropout-selection function P{M, Re, z) from the out- 
put simulation catalog for each dropout sample in each 
field; i.e., we have two sets of P{M, R^, z) for the i?435- 
dropout sample (one for the GOODS-depth images and 
one for the HUDF-depth images) and two sets for the 
Vfjoe-dropout sample. 

The dropout simulations involve many assumptions on 
the intrinsic SED (e.g. dust attenuation) and the struc- 
tural parameters (e.g. profile shape and axial ratio) of 
galaxies representative of the LBG population. Because 
we do not know the intrinsic distributions of the prop- 
erties of the entire population, various assumptions are 
inevitable. These assumptions can only be verified with 
better constraints on the physical properties of LBGs in 
the future. 

4.2.2. Calculating Selection Completeness 

The dropout simulations quantify the LBG selection 
function P(M, Re, z) within the absolute- magnitude bin 
(Af, M + AM), effective-radius bin (log^o Re, logio Re + 
Alogj^Q Re), and redshift bin {z,z + Az). By definition 
in each bin of (M, Re,z), 

„ , # of detected and selected sources 

P{M,Re,z) = ^ . 8 

total if of mput sources 

In other words, P{M, Re, z) is the probability that an 
LBG in the given {M, Re, z) bin is detected in the image 
and selected in our sample. We adopt narrow bin sizes of 
(AM, Alogioi?e, Az) = (0.5,0.2,0.1) so that the result 
is insensitive to the input distribution in either M, Re, or 
z (as long as the input distribution is not pathological). 

Figure [8] illustrates the dependence of the dropout- 
selection function P{M, Re, z) on absolute magnitude M 
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Redshift Redshift 

Fig. 8. — The dropout-selection functions P{M, Re, z) for B435- 
dropouts (top row) and for Vgoe -dropouts (bottom row) derived 
from the GOODS-depth images. In both rows, the function 
P{M, Re, z) are shown in different M bins (the left column) and 



in different 



1 Re 



bins (the right column). In the left column 
, Re < 0.8 (Re is in pixels of 0.03"), corre- 
sponding to roughly 0.12" < Re < 0.19". In the right column all 
bins have the same absolute magnitude —21 < M < —20.5. All 
absolute magnitudes and effective half-light radii are at rest-frame 
1500 A. The lower limits in absolute magnitude or effective half- 
light radius for each curve is indicated in the legend. The units of 
effective half-light radius is pixels (with scale 0.03"). 

and effective half-light radius derived froni the simu- 
lations in the GOODS-depth images. The top row shows 
the P{M, Re, z) for the i3435-dropout selection; the bot- 
tom row shows the same for the Vgoe-dropout selection. 
For both i3435-dropouts and Vgoe-dropouts, the selection 
function is skewed more towards the low-redshift end 
in fainter bins as well as in larger-effective-radius bins, 
showing that the redshift distribution of the dropout 
sample depends on both absolute magnitude and effec- 
tive half-light radius. The plots for the HUDF-depth im- 
ages are qualitatively similar, but they extend to fainter 
absolute magnitudes. 

4.2.3. Applying P{M, Re, z) 

As the first step towards building the expected ob- 
served size-luminosity distribution, we incorporate the 
dropout-selection incompleteness into equation [7] using 
P{M,Re,z). We calculate the expected distribution 9 
with the dropout-selection completenesses as 



{m,Re) 



*(A/, Re) P{M, Re, z) — SndMdz 



dz 



Sm = S [M - DM(z) - K-m] 



(9) 
(10) 



where dV/dz is the volume element contained in the sur- 
veyed area and redshift slice {z,z + dz), and Sm is the 
Dirac delta function that facilitates the transformation 
from 1500 A absolute magnitude M to apparent magni- 
tude m. DM(z) is the distance modulus to redshift z and 
K is the fc-correction term that converts from 1500 A to 
the observed bandpass. The integration limit in absolute 
magnitude Miim is chosen to include the faintest dropout 
candidate possible that could be selected in our sample. 
Although we integrate over all redshifts, P{M, Re, z) is 
only non-zero within the redshift range targeted by the 



selection criteria. By multiplying the volume element 
dV/dz, the distribution 6 predicts the number counts in 
the (rn, logiQ Re) plane. 

When implementing the correction, the integration be- 
comes a discrete sum over small bins of {M,Re,z) be- 
cause we only have a discrete sampling of P{M, Re, z). 
Therefore we define the bivariate model ^E" on a discrete 
grid with {dM , dlog^^ Re) = (0.02,0.02). Together with 
P{M,R,ez) defined in bins of (AM, A log^ ^e, A2) = 
(0.5,0.2,0.1), we calculate the distribution 9 also on a 
discrete grid with (dm, dlogm i?e) = (0.02,0.02). The 
details of such calculations are elaborated on in Appendix 

EH 

4.3. GALFIT Measurement Biases, Uncertainties, and 
Completenesses 

4.3.1. Description of GALFIT Simulation 

The second set of Monte Carlo simulations ( "GALFIT 
simulations" ) quantifies the measurement biases and un- 
certainties of magnitude and effective half-light radius. 
In this set of simulations, we again insert artificial galax- 
ies using IRAF package artdata with the same distribu- 
tions of magnitude and structural parameters (i?e, po- 
sition angle, ellipticity, etc.) as the dropout simulations 
fSection 14. 2p . We calculate transfer function kernels in 
apparent-magnitude and effective-radius bins of width 
(Am, A logiQ i?e) = (0.5,0.2) and assume that the ker- 
nel is unchanged within the bin, and we believe that the 
chosen bin widths are narrow enough so that the derived 
kernels should be relatively insensitive to the input mag- 
nitude and size distribution. During the simulations we 
run SExtractor to detect the artificial sources and then 
use GALFIT to measure their magnitudes and effective 
half-light radii. Because we only care about how GAL- 
FIT performs in the passband closest to rest- frame 1500 
A, we only collect results in the lyrs-band for the i?435- 
dropout sample and in the zgso-band for the Veoe-dropout 
sample. We run the process iteratively until we collect 
at least 100,000 objects in each band. 

4.3.2. GALFIT Transfer Function Kernels 

Using the output simulation catalog, we calculate the 
differences between the input and output magnitudes 
and effective half-light radii for all sources. We denote 
the input magnitude and effective half-light radius as m 
and Re, and the output (measured) values as m' and 
R'e- The differences between the input and the measured 
magnitudes and sizes are defined as 5m = m' — m and 
(51og^Qi?e = logigi?g — logiQi?e, respectively. Then in 
the i'-th magnitude bin (in which m^' < m < m^' -I- Am) 
and the j'-th size bin (in which (logj^Q Re)j' < logio -^e < 
(logj^o Re)j' + A logj^g Re), we calculate the transfer func- 
tion kernel T^F and completeness C^f, as 



T^]^ = V{6m,6\og,oRe) 



r'GF _ 
^i'j' - 



J J ViSm,Slog,oRe) 



(11) 

dSmdSlogigRe (12) 



where V {6m, 6 logiQ Re) is the probability density func- 
tion (PDF) in the plane of {Sm, Slogi^Re). The 
GALFIT-measurement completeness C^J, accounts for 
the fact that GALFIT does not always give satisfac- 
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Fig. 9.— The GALFIT (the left half) and SExtractor (the right 
half) measurement biases and uncertainties in different apparent- 
magnitude and effective-radius bins, represented by the probability 
distributions (PDF) in Sm (measured magnitude — input magni- 
tude) and S log Re (measured log Re — input l og Re ) ■ The PDFs 
(equivalent to the kernels described in Section 14. 3.21 1 are derived 
from the GOODS-depth i775-band images. The input magnitude 
range of each bin is in the lower left corner, while the input size 
range is in the upper left corner. The contours represent 90%, 50% 
and 10% of the peak value of the distribution. The panels on the 
right are for fainter magnitude bins, and the panels closer to the 
top are for larger-size bins. The plus sign denotes the point where 
(5m,51ogio Re) = (0,0). 

tory fits. (Here C^J, does not include the SExtractor- 
detection incompleteness.) We use the same criteria as 
described in Section [3] to calculate the fraction of sources 
poorly fit by GALFIT, namely those with GALFIT- 
returned errors [Re]err/ Re > 0.6 and xt > OA in the 
GOODS-depth images (and xl > 0-5 in zsso-band), and 
[Re]err/Re > 0.6 and xl > 5.0 in the HUDF-depth im- 
ages in both 1775- and zgso-band. 

The transfer function kernels T9F in different bins are 
shown in Figure 13 The panels on the lower left have 
high surface brightnesses, while the panels on the upper 
right have low surface brightnesses. In general surface 
brightness has an important effect on the behavior of 
the GALFIT measurement, namely kernels in low-surface 
brightness bins are more spread out (larger uncertainties) 
and center farther away from the origin (larger biases). 
The errors in the measured magnitudes and effective half- 
light radii are correlated as expected: an overestimation 
of flux {6m < 0) leads to an overestimation of effective 
half-light radius ((Jlog^o -Re > 0). GALFIT performs weU 
overall; even in low-surface brightness bins, GALFIT still 
shows little bias compared with SExtractor. 

4.3.3. Comparison with SExtractor 

One can also calculate the transfer function kernels 
for SExtractor-measured values. The kernels derived for 
SExtractor are shown in the right half of Figure [SI One 
can see the biases of SExtractor-measured values: SEx- 
tractor tends to overestimate the effective half-light ra- 
dius in high-surface brightness bins (lower left) and un- 
derestimate the effective half-light radius in low-surface 
brightness bins (upper right). This is the main reason 
we decide to use GALFIT instead of SExtractor for size 
measurement. 

We also show the plots of the measured magnitudes 
and effective half-light radii for very narrow ranges of in- 
put magnitudes around m — 24, 25, and 26 mag in Figure 
[TUl One can also see here that SExtractor in general sys- 
tematically returns magnitudes that are fainter than the 
input values. This indicates that SExtractor systemati- 
cally misses a portion of the light from simulated sources. 




'^■h.O 24.0 25.0 26.0 27.0 ""A.O 24.0 25.0 26.0 27.0 
output magnitude output magnitude 

Fig. 10. — The measured magnitudes and sizes for the simulated 
sources within narrow ranges of input magnitudes around m = 
24, 25, and 26 mag derived from GOODS-depth j775-band images. 
We show measurement results for both GALFIT (left panel) and 
SExtractor (right panel). Results are shown for different input 
profile shapes: exponential profiles (n = 1) are shown in dots, 
while the de Vaucouleurs profile (n = 4) are shown in crosses. Both 
software show larger biases for the de Vaucouleurs profile, and the 
faint, extended wing of this profile is likely the cause. However, 
SExtractor shows larger biases for both profile shapes. 

Although GALFIT measurements show similar scatter, 
there is no obvious bias in Figure [TOl These comparisons 
demonstrate the advantages of profile fitting, at least in 
the case of smooth galaxies. In future efforts, we plan to 
quantify the biases for realistic — and possibly clumpy 
— galaxies drawn from hydrodynamical simulations. 

4.3.4. Applying T'^^ 
We apply the GALFIT transfer function kernels T^^^ 
to 6{m,Re) in equation [9] This is done by convolving 
9{m, Re) within the i'j'Ah bin of (m, Re) with the trans- 
fer function kernel pertinent to that bin, and summing 
the contribution from all bins together: 

ipi,j,{m',log^QRe) = {m, \og^Q Re) *T^j^{Sm,S log^Q Re) 

(13) 

^{m', logio -Re) = E ^''J' ("^'' logio K) (14) 

i'j' 

The distribution tp{m' ,\ogiQ R'^) is the expected dis- 
tribution in measured magnitude and effective half-light 
radius corresponding to the set of parameters P. There- 
fore ip{m' ,logio R') accounts for all the systematic ef- 
fects related to dropout-selection and GALFIT measure- 
ment. We define the final distribution on the grid of 
bin size {dm' ,d log R'^) = (0.02,0.02), and the detailed 
mathematical formulation of Eq. [T51 and [U on the 
grid is included in Appendix IA.21 After applying the 
dropout-selection function P{AI,Re,z) f Section I4.2.3P 
and GALFIT-transfer-function kernels T'^^, we arrive 
at the expected distribution of observed dropouts. The 
distribution corresponding to the fiducial set of parame- 
ters (a,M*,i?o,crinfl,,/3) = (-1.7,-21. 0,0.21", 0.7,0.3) 
is shown in the bottom panel of Figure [71 

4.4. Fitting Procedure 

We use the maximum-likelihood estimation (MLE) 

method to recover the best-fit parameters P = 
{a, M* , Ro,ainR^, l3). Our formulation is equivalent 
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to that proposed by 



iTrenti fc Stiaveiiil[20Ci8l: 



Sandage et al.l (|1979f ) (see also 
Su et al.ll201lD . In general, given 
a set of observed data points {xi,X2, Xn) and a model 
T{x) that describes the distribution of the observed 
points, the hkehhood of the model T{x) is 

n 

c = r(xi) • T{x2) ■ ■ ■ r{xn) = Y[r{x,) (15) 

i=l 

Here we try to determine the set of parameters 
P = (a, M*, i?o, CTinfl^, /3) that produces the model 
'ip{m' ,logiQ R'f.) that will maximize the likelihood func- 
tion £ for the observed magnitudes and sizes. Here we 
also need to include the component of the interlopers 
J(m', login de scribed in Section [53] (and calculated 
in Appendix IA.3P that accounts for the number counts 
of possible low-z interlopers due to photometric scatter. 
Therefore for each dropout sample we combine the log- 
likelihood from both fields (GOODS+HUDF) and find 
the set of parameters that maximizes the combined log- 
likelihood. In short, for a given set of parameters P 
we first calculate the overall normalization constant 0* 
through the equation 



^«(™;, \og,o{K)i) + /^(m^, logio(i?^)0 



logio(i?l)0 + /H(m;, logi„(i?l)0 



(16) 

where A^g and A'h are the total number of dropouts in 
GOODS and HUDF, respectively. The volume surveyed 
in each field has a lready been included in -0(m', logj^g R'^) 
(see Section 14.2.31 and Eq. |9]). We then calculate the sum 
of the log-likelihood of both fields: 

logio/:G(P)=A^Glogio 0*(P) 
Ng r 

i—1 '- 

+ /G(mUogio(i?e)^) 

logio£H(P) = A^Hlogio <^*(P) 



H 



4=1 

+ /H(TOUoglo(^e).) 



^'"(P; m'„logio(i?e) 



(17b) 



logio Aot(P) 



iogio/:G(P) + iogio/:H(P) (17c) 

We use the scipy package optimize to find the global 
maximum of logj^p'^tot- Within the package optimize, 
the fu nction f min_l_bf g:s_b uses the L-BFGS-B algo- 
rithm (|Zhu et al.l 119971 ) to search for global minimum. 
The best-fit parameters for each dropout sample are pre- 
sented in Section [SI 

5. BEST-FIT PARAMETERS AND CONFIDENCE 
INTERVALS 



The best-fit values of (a, M*, 0*, i?o, cinK, /?) for both 
^435-dropouts and Veoe-dropouts are listed in Table [3l 
We plot the model distributions corresponding to the 
best-fit values along with the GALFIT-measured mag- 
nitudes and effective half-light radii in Figures [TT] and 
[T^ Also plotted are the projections of the best-fit dis- 
tribution onto the magnitude axis (top panel) and the 
effective-radius axis (right panel) . Though we derive the 
best-fit parameters using the GOODS and HUDF fields 
jointly, we plot them separately for clarity. 

Although the maximum-likelihood estimator is a pow- 
erful tool to obtain the best parameters, it does not tell us 
how well the best-fit model matches the data. Therefore 
we adopt a Monte Carlo approach to assess the goodness 
of fit. We use the best-fit model to randomly draw a sim- 
ulated data set that contains exactly the same number 
of points as the observed data. We then calculate the 
likelihood of the simulated data set and compare it with 
the likelihood of the real data. We repeat the process 
5000 times and calculate the fraction of simulated data 
sets that have lower likelihood than the real data. For 
the i3435-dropout sample 89% of the simulated data sets 
have lower likelihood than the real data; for the Vqqq- 
dropout sample 42% have lower likelihood than the real 
data. We conclude that the best-fit models describe the 
observed distribution reasonably well. 

We show two size-luminosity relations in Figures [TT] 
and [12] The black dotted line minimizes the distance 
to all the observed points, so it traces the observed dis- 
tribution. The red dashed line corresponds to the best- 
fit size-luminosity relation with transfer functions. The 
slope of the black dotted line is steeper than that of the 
red dashed line because the latter tries to account for the 
surface-brightness-related selection effects. 

We plot the projected size distribution for both fields in 
Figure [TT] and [T2l in the right panels. The dashed curve 
is the size distribution predicted by the best-fit model 
parameters; the dotted curve is the distribution after ac- 
counting for the dropout-selection incompletenesses only, 
and the solid curve is the distribution after including 
GALFIT completeness, bias and scatter. The peak of the 
predicted intrinsic distribution (the blue dashed curve) 
is above the PSF size, implying that the decline of size 
distribution at sizes smaller than the observed peak re- 
flects the true peak of the underlying size distribution, 
and is not due to measurement incompletenesses alone. 

The predicted intrinsic luminosity function (as param- 
eterized as Schechter function) as well as the transformed 
luminosity functions are also plotted in the top panels in 
Figure[TlJ The shape of the predicted intrinsic Schechter 
function and the transformed luminosity function trace 
each other well, and serious incompleteness only sets in 
at the very faint end of our observed magnitude range. 

We use the Markov Chain Monte Carlo (MCMC) sam- 
pling to derive the uncertainties for each parameter and 
to explore the degeneracies among the different param- 
eters. The MCMC method is a powerful way to sam- 
ple complex, multi-dimensional probability distribution 
functions numerically. To do this, we use the public 
python package emce 43 (jForeman-Mackev et all 12012). 
In Figure [T3] and [TJ] we show the 68% and 95% confi- 
dence contours for all pairs of parameters. We plot the 

^ |http: //danfm. ca/ emcee] 
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TABLE 3 

The best-fit parameters of dropout samples. 



Sample 


a 


M* 


(lO-^Mpc-i) 


(arcsec) 


(kpc) 


""In Bo'^ 


/3 


i?435-dropouts 
Veoe-dropouts 


1 fiS+0.068 

_1 74+0.15 
^- '^-0.20 


-20.60+017 
-20.53t«;r7 


1 7Q+0.32 
'^-0.52 
, c:t;+0.62 
1.00_o 77 


^•^^-0.016 


1 04+0. 099 

-^■■'^-o.ios 

1 10+0.21 
^■^^-0.16 


r, oq+0.046 
'J-O'J_0.044 

'-'■^'-'-0.065 


r, 99+0.058 
^•^^-0.056 

25+°-l'5 



assuming 2; = 4 for i3435-dropouts and 2 = 5 for Veoe-dropouts. 
peak of the size distribution at Afisoo = —21 mag 
" o-infl^ = criogjQ X In(lO), where o-iog^o is in dex 



contours of both samples on the same scale so one can 
see immediately that the parameters for the Vgoe-dropout 
sample are less well constrained than the i?435-dropout 
sample. This is not surprising given the smaller sample 
size of the Vgoe-dropouts. Because the marginal distri- 
bution around each best-fit parameter is in general not 
Gaussian, we quote errors in Table |3] as the narrowest 
intervals around the best-fit values that bracket 68% of 
the probability. There is some weak correlation between 
the luminosity- function parameters (a, M*, and </>*) and 
the size-distribution parameters (logi?o, crin/?^, and 
Especially interesting is the weak correlation between a 
and /3 (the Pearson's correlation coefficient r = 0.27): 
shallower size- luminosity relation (smaller |/3|) leads to 
steeper faint-end slope a due to larger incompleteness 
corrections in the faint end. For example, if /3 = 0.1, 
the faint-end slope a become s ~ —1.9. A s imilar cor- 
relation was a l so dis cussed in IBlanton et al.l (|2001D and 
iGrazian et all (|2011[ ). The errors this degeneracy be- 
tween a and /3 introduces when one derives just lumi- 
nosity function alone likely requires careful simulations 
to quantify, but from the contours of Figure [l3] we esti- 
mate that the additional uncertainty in a is likely to be 
at most ~ ±0.05 for i3435-dropouts. 

In deriving the corrections for GALFIT measure- 
ment (the transfer-function kernels; see Section 14.31) 
we make the assumption that the input sources com- 
prise 70% exponential (late-type) profiles and 30% 
de Vaucoulerus (early-type) profiles. This is a rea- 
sonable approximation to what we see in Figure [Sj 
However we also tried fitting with kernels derived 
with 100% exponential profiles (disk-only kernels) and 
100% de Vaucouleurs profiles (spheroid-only kernels). 
The best-fit parameters derived using disk-only ker- 
nels are indistinguishable from our nominal results: 
the best-fit parameters for i3435-dropouts are P = 
(-1.68, -20.59, 0.19", 0.82, 0.22), while those for Vgoe- 
dropouts are P = (-1.75, -20.55, 0.19" , 0.88, 0.25). 
These are within the 68% contours of our nominal best- 
fit parameters. On the other hand, the parameters de- 
rived using the spheroid-only kernels are significantly 
different from our nominal results: the best-fit param- 
eters using the spheroid-only kernels for i3435-dropouts 
are P = (-1.55, -20.33, 0.32" , 0.97, 0.11), while those for 

yeoe-dropouts are P = (-1.43, -20.50, 1.30" , 1.50, 0.39). 
While we can not reject the hypothesis that all LBGs 
have n = 4 Sersic profiles based on the best-fit size- 
luminosity distributions, it is unlikely given the observed 
Sersic-index distribution in Figure [SJ We conclude that 
our size-luminosity distributions are reasonable as long 



as the majority of LBGs have close to exponential pro- 
files. 

6. DISCUSSION 
6.1. Comparison with Previous UV LF Determinations 

We compare our derived best-fit values of the Schechter 
function with those of other studies at similar redshifts 
in Figure [TSl Our derived a and cj)* are in general consis- 
tent within 1-(T of those from most studies; however, our 
M* is about 0.2 — 0.4 mag fainter than previous studies. 
The fainter M* is possibly driven by the low complete- 
ness near the faint end, where we potentially miss many 
galaxies due to low surface brightness. The inclusion of 
photometric interlopers is likely not the origin of the dis- 
crepancy: without photometric interlopers, the best-fit 
(a, Af*, (/)*) becomes (-1.64, -20.48, 2.21 x lO'^Mpc"^) 
at z - 4 and (-1.67, -20.51, 1.70 x 10"^ Mpc"^) at z - 
5, statistically indistinguishable from our nominal best- 
fit parameters. Extending our metho dology to include 
more field s (e.g. HUDF parallel fields; [Thompson et al.l 
2005; Oes ch et alll2007[ ) wiU better constrain the LF pa- 
rameters by increasing the sample size. 

However, we have reason to believe that the cos- 
mic variance introdu ces very small bia s es in our best- 
fit par ameter values. iTrenti fc Stiavelll (j2008[) (see also 
[Rober tson 201(|) has a comprehensive discussion about 
the effect of cosmic variance on the determination of lu- 
minosity functions at high redshift, especially in narrow, 
pencil-beam fields such as HUDF. In particular, in their 
Section 5 they examined how the number density of high- 
redshift dropouts will bias the best-fit Schechter function 
parameters. They demonstrate that within a single field, 
M* tends to be fainter when the field is under-dense; 
however, when one combines a shallower, larger field with 
a deeper, smaller field, and derives the LF parameters 
using the unb i nned maximum-likelihood method as in 
'Sandage e t al.l ()1979( ). biases in both a and M* can be 
re duced. Because our fit ting method is the one proposed 
bv iSandage et al.l ([1979') generalized to include multiple 
fields, we do not expect our best-fit a and M* to be 
strongly biased due to cosmic variance. 

6.2. The Size Evolution 

The peak of the size distribution at the same rest- 
frame 1500A absolute magnitude, M1500 = —21 mag, 
is 1.34+Q °Qg kpc for the i?435-dropout sample (assuming 
z = 4) and is 1.19+g'^g kpc for the Vgoe-dropout sample 
(assuming z = 5). If instead we calculate the peak of the 
size distribution at the best-fit M* of each redshift, we 
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Fig. 11. — The GALFIT-measured magnitudes and effective half-light radii of i?435-dropouts with best-fit model (with interloper con- 
tribution) in the GOODS (upper panel) and in the HUDF (lower panel). The set of best-fit parameters P is shown at the top. The 
i?435-dropouts used for fitting are shown as black points in the main panel. The contours correspond to the best-fit model accounted for 
both dropout-selection and GALFIT-measurement. The thin dotted line is the line that minimizes the sum of the distances to all the data 
points; the thick dashed line is the best-fit size-luminosity relation. The top inset of each panel shows the marginalized luminosity function 
from the best-fit model. The dashed line is the Schechter function corresponding to a boxcar selection function within 3.5 < z < 4.5; 
the dotted line is the function accounted for dropout-selection effects only; the solid line is the function accounted for both the dropout- 
selection and the GALFIT-measurement effects. The black points with Poisson error bars are the number of dropouts in each magnitude 
bin. The right inset of each panel shows the marginalized size distribution. The dashed line is the log-normal distribution; the dotted line 
is the size distribution accounted for dropout selection; the solid line is the distribution accounted for both dropout selection and GALFIT 
measurements. The black points with Poisson error bars are the observed numbers in each size bin. 
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P = [-1.74,-20.53,0. 19",0.90,0.25] 




Fig. 12. — The GALFIT-measured magnitudes and effective half-light radii of Veoe-dropouts in GOODS and in the HUDF with the 
best-fit model. The plot legend is the same as Figure [TT] 



will find that the size distribution peaks at 1.24 kpc at 
z = 4 (at M* = -20.60 mag), and at 1.07 kpc at z = 5 
{M* = —20.53 mag), showing slightly stronger evolution 
than at fixed luminosity. 

The convention al picture of iFall fc Efstathioul (19801) 
(see also Fall IQSl lDalcanton et al.lllQQTtlMo et aLlll998t 
iFalll 12^002.) states that the size of the gaseous disk Rd, 
formed in the center of dark-matter halo that acquired 
angular momentum through tidal interactions, is propor- 
tional to the virial radius i?vir of the host dark-matter 



halo and the spin parameter A of the halo: Rd oc Ai?vir- 
Assuming that the peak of the size distribution Ro{z) at 
a given luminosity Lq traces the peak of the log-normal 
A distribution, and the A distribution correlates weakly 
with redshift or halo mass (Barnes & Efstathiou 1987), 
one comes to the conclusion that Rq{z) oc i?vir(2^)- One 
can further relate the virial radius i?vir(-2^) of the halo 
to the circular velocity Vc and the mass Mh of the halo 
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Fig. 13.— The 68% (dark gray) and 95% (light gray) confidence 
intervals of S435 -dropouts for each pair of parameters. On the 
top of each column are the normalized probability density function 
(with arbitrary units on the y-axis) of each parameter labeled at 
the bottom of each column. The unit of logiJo is in pixels, with 
pixel scale 0.03". The correlations between the LF parameters (a, 
M* , (f>*) and the size-distribution parameters (log^QiJoi ""infl; /3) 
are relatively weak. 

through (|Ferguson et al.ll2004 ) 

so that Rd oc H^^{z) if i?vir at a fixed luminosity 
traces constant circular velocity, or Rd oc H~'^/'^{z) if 
i?vir traces constant halo mass, and H{z) is the Hub- 
ble parameter. In the concordance cosmology model, 
the Hubble parameter as a function of z is H{z) = 
Ho^/nm{l + zy + ^A, so at z > 2, H-^z) - {1+z)-^^^ 
and H~'^^^{z) ^ (1 + z)~^. Between z — A and 5, both 
relations predict only 20% and 31% changes in size, re- 
spectively. The peak size at M1500 = — 21 mag evolves 
only by 13% from our results, favoring only slightly the 
(1 + z)~^ relation. One needs data at lower redshifts 
to constrain the size-evolution scenario, where these two 
scaling relations diverge. 

Studies of size evolution of disk-like (or late-type) 
galaxies with similar luminosities have generally shown 
larger fractio nal changes in s ize at z < 3. At z 5. 1-1 , 
using GEMS (|Rix et al. 1'2004) data. ' Harden et al.l (f2005l ) 
found that the mean rest-frame V-band surface bright- 
ness of late- type galaxies with My ^ — 20 mag brightens 
by about 1 mag from z ~ to z ~ 1. This corresponds 
to a 40% decrea se in mean size of the galaxies with simi- 
lar luminos ities. 'Truiill o et al.l ()2006D . combining FIRES 
(jFranx et a l. 2000), GEMS, and SDSS (|York et al. 2000) 
data, found that at rest-frame y-band luminosity Ly > 
3.4 X 10i°/ifff Lq (or Mv < -21.53 mag) the mean effec- 
tive half-light radius of late-type galaxies is only about 
30% of that of t he loc al SDSS sample. At high redshift, 
[Ferguson et al.l (|2004[) found the mean half-light radius 



Fig. 14. — The 68% (dark gray) and 95% (light gray) confidence 
intervals of Vgoe-dropouts for each pair of parameters. On the 
top of each column are the normalized probability density function 
(with arbitrary units on the j/-axis) of each parameter labeled at the 
bottom of each column. The unit of log Rq is in pixels, with pixel 
scale 0.03". Similar to the case of B435-dropouts, the correlations 
between the LF parameters {a, M* , (f>*) and the size-distribution 
parameters (logj^QiJo, uinR, are relatively weak. Furthermore, 
the parameters for the Vgoe-dropouts are less well constrained than 
those for B435-dropouts, likely due to a smaller sample size, as the 
scales on the axes indicate. 

of galaxies selected in GOODS from z ~ 1.4 to z ~ 5 fol- 
lows the H~^{z) ^ (1 -I- z)~^^^ scaling relat ion. On the 
other hand, both lBouwens et al.l ()2004[ ) and lOesch et al.l 
(I20T0I) found the mean half-light radii of LBGs, selected 
in HUDF and HUDF ACS-parallel fields, to r oughly trace 
1 -|- z )~^ from z ~ 3 to z ~ 8, although lOesch et al.l 
2010f ) noted that the peak of the size distribution evolves 
little between z ~ 4 — 8. From the above observations 
of star-forming galaxies, the fractional change in size at 
similar luminosities is significant between z: = — 3, but 
smaller at z ^ 3. 

We plot the mean sizes of LHGs at z > 2 (in rest-frame 
UV) from a compilation of previous papers, as well as our 
best-fit Rq at M1500 ~ —21 mag, as a function of redshift 
in Figure 1161 Also included in the figure are the curves 
H{z)~^ and H{z)~^^^, both normalized to the derived 
Rq at z — 4 of this work. Due to different measurement 
methods, definitions of size, corrections for measurement 
biases and uncertainties, and corrections for incomplete- 
nesses, systematic differences in size at the same redshift 
exist. (The definition of effective half-light radius that 
we use is also meant to enclose half the total flux from 
a galaxy, but it does try to compensate for the missing 
flux at the edge of a galaxy, unlike the half-light radius 
measured by SExtractor as a dopted in all ot her papers 
included here.) For example. iFbrguson eiTall (,2004) de- 
rived the mean sizes of LHGs at z ~ 1 — 5 with rest-frame 
UV luminosity between 0.7L*^3 and 5L*^3, where L*^3 
is the charac t eristi c luminosity of LHGs at z = 3 from 
'Steid el et al.l (|1999[ ). They did not correct for sample in- 
completeness and measurement bias, and they reported 
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Fig. 15. — The best-fit values of a, M* , and 4>* along with 
the 1-iT and the 2-cr confidence contours, of the Schechtcr func- 
tion for UV LF at 2 ~ 4 {top panel) and 2^5 {bottom 
panel) fro m this study and from t he literature. The refer- 
ences are: 'Bouwens et al."2007" (B07); lYoshida et al.' '20061 (Y06); 
Kjckwith ct al. 2006 (Bc06); Sawicki & Thompson 2006 (ST06); 
^iavalisco 2005 (G05); Ouchi ct al.. .2004. (Ou04): Stcidel et al.l 
1999 (S99); Ocsch ct al. 2007 (OeOT): llwata et al.n20071 (107): 
van der Burg et al.. 2010 (vdBlO). 



the average size instead of the peak of the size distri- 
bution as in this paper (the mean of a log-normal dis- 
tribution is larger than the peak due to the tail at the 
high end). Comparisons between high-z LBGs and lo- 
cal galaxies are also hampered by different rest-frame 
wavelengths probed. In order to compare with the rest- 
frame optical sizes of local galaxies, one needs resolved 
images of most LBGs at z > 3 in the rest-frame opti- 
cal. However, currently such data are not available, and 
the James Webb Space Telescope will be necessary for 
this line of investigation. We also wish to extend our 
analyses to lower-redshift samples where high-resolution 
rest-frame UV data are available (down to z ~ 2; see e.g. 
iHathi et al. 2010) to obtain consistent size measurements 
across a longer redshift baseline. 



6.3. Width of the Size Distribution 

We find that the width of the size distribution, a\nR, 
is 0.83;°;°^^ at z - 4 and 0.90j:°:J^5 at z ~ 5. This 
seems to indicate that the width of the size distribution 
at fixed luminosity is slightly larger at z ~ 5, and both 
are larger than the width of the A distribution (ain x ^ 
0.5) of dark matter halos found i n the iV-body simu- 
lations (iBarnes fc EfstathioiJ [19871: IWarren et all Hool 
IBullock et al.ll2001l) . 

We plot the distribution of sizes of the sample — both 
measured by GALFIT and by SExtractor — in Figure 
1171 The observed widths of size distribution, without 
any statistical correction for biases and uncertainties, are 
cinR ^ 0.7 for the GALFIT-measured effective half-light 
radii. In comparison, the SExtractor-measured half-light 
radii shows narrower distributions: the widths of the 
best-fit log-normal distributions are crinR ^ 0.4. Al- 
though the widths of the SExtractor-measured size dis- 
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Fig. 16. — The size evolution of LBGs at z > 2 from this work 
and previous s t udies. Included in this figure are results from 



IFerguson eFaD |(200D (F04; their mean size for LBGs between 
0.7Z/*_3 and 5L*_3, where L*_, is the characteristic luminosity of 
a 2 = 3 LBG from fSteidel et al.lfl9 99l. Bouwens ct al. ( 2004) (B04; 
the mean size for LBGs between 0.3L*_3 and L* j). iHathi et aLl 
II2008I) (H08; the mean size of LBGs with spectroscopic redshifts), 
and " lOesch et al] |[20Tol ) (Oel O; the mean size of LBGs between 
0.3L*_3 and L*_^, similar to lBouwens et al . 2004). In this paper 
we adopt the effective half-light radius Re that encloses half the 
total flux as galaxy size, therefore we try to compensate for the 
missing flux at the edge of a galaxy, while all other studies quoted 
here adopt half-light radius {Rh) measured by SExtractor. We ex- 
plored the bias of effective half-light radius and half-light radius in 
Section [4.3l The size evolution relations H{z)~^ and H{z)~^/^ are 
plotted as the solid and dashed line, respectively, with the curves 
normalized to the 2 = 4 size from this work. All sizes are converted 
to the cosmological parameters adopted in this study at the quoted 
mean redshift of each sample. Mean sizes at 2 < 2, at the same 
rest-frame wavelength as probed at higher redshifts, are required 
to distinguish between different evolutionary scenarios. 



tribution are more in line with the theoretical expecta- 
tions ((Tin A ^ 0.5), we believe that it is biased because 
(1) SExtractor is severely limited by the PSF size, so it is 
strongly biased (i.e. the measured size is bigger than the 
true size) when the galaxies are barely resolved, and (2) 
for large galaxies SExtractor is also strongly biased due 
to the tendency to miss the tail of the light distribution. 
We therefore believe that the intrinsic width of the size 
distribution is significantly wider than what SExtractor- 
measured distribution indicates, and our statistical treat- 
ment indeed shows that the intrinsic size distribution is 
wider than the that measured by SExtractor. 

One must also consider the fact that GALFIT (or 
any size-measurement method) can artificially broaden 
the observed size distributions due to measurement un- 
certainties. However our results show that although 
GALFIT might artificially broaden the intrinsic size dis- 
tributions, the size distributions after accounting for 
sample-selection incompleteness are even wider than the 
GALFIT-measured distributions. Therefore sample in- 
completenesses, especially at the small- and large-size 
ends, are more important in shaping the observed widths 
of the size distributions than whichever measurement 
method one uses. 

The widths of the size distribution of local galaxies 
are in general smaller than what are found here for 
LBGs. When considering galaxies divided b y their mor- 
pholog ical types drawn from SDSS (z ~ O.D. lShen et all 
(|2003f ) found that crinR ^ 0.3 at the bright-end, while 
the width increases to trinR ~ 0.5 at the faint-end. 
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Fig. 17. — The distribution of SExtractor-measured half-light 
radii (-R50) and GALFIT- measured effective half-Hght radii (Re) 
together with their best-fit log-normal distributions without ac- 
counting for incompletenesses or biases. Light-gray histograms and 
dashed curves are those for the SExtractor-measured i?5o; dark- 
gray histograms and dashed curves are those for the GALFIT- 
measured Rg. The GALFIT- measured R^ only include the ones 
with acceptable fits. The black solid curve is the volume- weighted 
sum of two log-normal distributions in GOODS and HUDF that 
correspond to the best-fit parameters listed in Table |3] It repre- 
sents our best estimate of the underlying size distribution of the 
two fields. The histograms and curves are normalized to give a to- 
tal probability of 1, so incompleteness is not included. The widths 
of the lognormal distributions ai^ ^ are indicated in the legends. 

This is true when considering either late-type galaxies 
alone or early-ty p e gal axies alone. Furthermore, both 
Ide Jong fc LacevI (|2000[ ) and iCourteau et"an ()2007[ ) de- 
rived that (TinR ~ 0.3 for their local spiral galaxies. AU 
of the above results show that the size distribution for 
local disk galaxies is narrower than the A distribution of 
DM halos, and mechanisms to systematically remove disk 
galaxies residing in small- A and large- A halos such as bar 
instability and supernova feedback are suggested. Given 
the wide size distribution derived for LBGs in this work, 
it is possible that LBGs sample the whole spectrum of 
A, and some aspects of baryon physics further broaden 
the size distribution. In addition, there are also hints of 
wider size distributions than that of late-types or early- 
types alone when we consider both types together (e.g. 
[F^ligsafRomanowskv fc Fallll2012j ). Because it is still 
unknown whether the majority of LBGs are rotation- 
supported disks, the relation between the width of the 
size distributions of LBGs and that of A is still not clear. 

6.4. The Size- Luminosity Relation 

Adopting the power-law form of the size-luminosity re- 
lation i? cx L^, we find that l3 = 0.22j:E; °^^ for the B. 



dropouts and (3 = 0.25 



-f0.15 
-0.14 



for the Vf 



606- 



435- 

dropouts. The 



trend between the peak of the size distribution R{L) and 
luminosity L, without accounting for incompletenesses at 
the low-surface brightness end, is steeper due to selection 
biases against low-surface brightness galaxies. The ob- 
served /3, without accounting for incompleteness, is 0.32 
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Fig. 18. — The best-fit rest-frame 1500 A size-luminosity (RL) 
relations of z ~ 4 and 2: ~ 5 LBGs (black and blue lines, re- 
spectively), and the hatched area shows the uncertainties of the 
normalization Rq . Also included are the best-fit size-luminosity re- 
lations for local galaxies from the literature. The slopes of the size- 
luminosity relations are listed in Table [4] Note that the local size- 
luminosity relations are in rest-frame optical wavelengths instead 
of the UV wavelengths probed for high-2 LBGs. The green and red 
dash ed lines are the SDSS r-band late-type and early-type galaxies 
from lShen eTall l(2003i 1 (SOS) with Mr between -24 and -16 mag 
for early-types, and —24 and —19 mag for late- types. The magenta 
dot-dashed line is from ICourteau et all H2007I ) (C07); the /-band 
luminosity Lj of their local disk galaxies range from 10*-5 to 10"-^ 
L/.Q . The orange solid and dashed lines are for the disk-|-bulge and 
disk-only components of the galaxies from lde Jong fc L accv (200(1) 
(dJLOO); the /-band absolute magnitude Mj of their disk galaxy 
range from —25.5 to —16.5 mag. The slopes of LBGs and local disk 
galaxies are similar, but the offset in normalization reveals the size 
evolution between z ~ 4 — 5 and z ~ 0. The error bars on the 
top-right corner show the width of the size distribution for LBGs. 
The projected size-luminosity relation from the best-fit relation at 
z ~ 4 are plotted with a gray solid line (following H{z)~^, which is 
~ (1 + z)^'^ at 2 > 2) and a gray dashed line (following H{z)~^^^ , 
which is ~ (1 -(- z)~^ at 2 > 2). It is interesting that the prediction 
from H{z)~^^^ matches the Ide Jong fc LacevI 1 12000 ) results very 
well. 



for the i3435-dropouts and 0.30 for the Veoe-dropouts. 
We also plot in Figure [12] the peak of the log- normal size 
distribution R{L) and the width cJinR in different appar- 
ent magnitude bins, when a, M*, and /3 are held fixed 
to the best-fit values. Different scaling relations between 
the mean size R and luminosity L are also plotted. Af- 
ter accounting for incompletenesses, the size-luminosity 
relation is still inconsistent with being flat. 

The observed size-luminosity (RL) relation of low-2; 
disk galaxies in the optical bands has been shown to have 
similar power-law index /3 as our result for the high-z 
galaxies. We compile the size-luminosi ty relations for lo- 
cal galaxies in Table S) For example, Ide Jong fc LacevI 
llOOO!) observed that Rd (x L°-^^ for their local disk 
galaxy sample with types later than Sb. iShen et al.l 
( 2003) observed that Re (x L°-^^ in the faint-end of their 
late-type galaxy sample (defined as n < 2.5) drawn from 
SDSS, where Re is their best-fit Sersic effective half-fight 
radius, although they also observed that at the bright 
end of their late-type galaxies the observed slope is sig- 
nificantly steeper (/3 ~ 0.5). [Courtcau et al. (2007) ob- 
served that Rd oc L°-'^^ for their local disk galaxy sam- 
ple, including both field and cluster galaxies, and also 
observed a type-dependent trend in /3 such that later- 
type disk galaxies have shallower slope (/3 = 0.25 for 
their Sd-type sample). On the other hand, the observed 
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RL relation for focal early-type galax ies generally hav e 
steeper slopes: /? ~ 0.5—0.6 in general (jShen et al.ll2003[ ). 
We plot the RL relations from local galaxies along with 
the derived RL relations for out LBGs in Figure [TH 
The agreement between the slopes of the local RL re- 
lations and those for the LBGs in this study is interest- 
ing. Although most galaxies in our LEG sample have 
n < 2.5, which will be classified as late-type galaxies 
based on Sersic index only, whether they are disk galax- 
ies is still uncertain. For example, LBGs have clliptic- 
ity distributions in consistent with being drawn from an 
optic ally-thin disk (jRavindranath et al.ll2606l : iLaw et al.l 
120121) . On the other hand, the normalization of the RL 
relation between different redshifts reveals the amount of 
size evolution between these two epochs. The similarities 
between the size-luminosity relation of the local latc-type 
galaxies and high-z LBGs suggests that their formation 
histories share some similarities, or even that LBGs are 
the progenitors of local disk galaxies. Furthermore, if 
one scales the best-fit RL relation at z ~ 4 according 
to H{z)~^/^, the resultant RL relation matches the local 
relat ions very well ( th e dark -gray solid line i n Fig ur ellSI) . 

' ' mm 



ide Jong fc Laceyi dgOOl (see also iFal m 
IDalcanton et al.l 119971: IMo et al.l I1998D offered 
derivation of the expected value of /? for disk galaxies. 
We saw in Section 16.2! that in the standard picture of 
disk formation, i?d oc Ai?vir, and the virial radius i?vir is 
proportional to the circular velocity Vc at a given redshift 
(Eg [T8t . The Tully-Fisher relation ()Tullv fc Fished 
I1977D of local disk galaxies in the i-band shows that 
Vc (X Ll^, where ^ - 0.29 ()Courteau et al.ll2007l) . This 
translates to Rd oc XVc oc \L^ . The best- fit (3 of our 
sample, 0.22 for the i?435-dropouts and 0.25 for the 
Vfjoe-dropouts, are slightly smaller than the prediction 
by the simple argument, but are still consistent with the 
TF slope of local disk galaxies. 

7. SUMMARY 

We study the bivariate size-luminosity distributions of 
Lyman Break galaxies (LBGs) at redshifts around 4 and 
5. We select LBGs from the GOODS and the HUDF 
fields to constrain the size-luminosity distribution. We 
break down the possible systematic errors into two com- 
ponents — one from dropout selection, the other from 
size and magnitude measurement by GALFIT. We then 
perform comprehensive simulations to quantify these er- 
rors, and we develop numerical tools to apply the cor- 
rections statistically to our model — both in magnitude 
and size bins. The bivariate size-luminosity distribution 
model is a combination of Schechter function and a log- 
normal distribution, and our best-fit parameters, after 
all the corrections, are summarized in Table [3l 

The best-fit parameters for the luminosity function 
show a slightly shallower faint-end slope a, slightly 
brighter characteristic absolute magnitude M*, and 
higher normalization constant 0* of the luminosity func- 
tion at z ~ 4 than at z ~ 5. However the parameters are 
consistent with no evolution within 1-a. The parameters 
for size distribution also show mild evolution between 
z ~ 4 and 5, with _B435-dropouts having larger peak size 
i?o at A/1500 = —21 mag, narrower width crini?^ of size 
distribution, and shallower size-luminosity relation slope 
/3. The faint-end slope a and the slope of size-luminosity 




25 26 27 

apparent magnitude 

Fig. 19. — The mean of the log- normal size distribution in ap- 
parent magnitude bins (24, 25), (25, 26), (26, 27), and (27, 28.5) 
for both iJ435-dropouts and Vsos-dropouts. The apparent magni- 
tude is in the iyrs-band for the 6435 -dropouts and in the 2:85o-band 
for the Veoe- dropouts. The error bars in the y-direction are the 
±1(T width of the log-normal distribution; the error bars in the x- 
direction are the bin width in magnitude. The size distributions in 
each magnitude bin are fitted when a, M*, and /3 are held fixed to 
the best-fit values in Table |3] Size-luminosity relations of the form 
R oc with different 13 are also plotted. 



relation /3 show a mild correlation, suggesting the im- 
portance of size-luminosity relation in the completeness 
correction of the faint end of luminosity function. 

A significant result from our derived size distribution 
is that the widths of the size distributions for LBGs at 
z ^ 4 and 5 are larger than the width of the spin param- 
eter distribution observed in cosmological A'^-body simu- 
lations, as opposed to local disk galaxies which have nar- 
rower size distributions. We argue that the narrow size 
distributions observed could be due to the bias inherent 
in the size-measurement method. This also implies that 
the LBGs sample the full range of the A distribution. 
The slopes of the size-luminosity relations /3 are consis- 
tent with those for local late-type galaxies, but are shal- 
lower than local early-type galaxies. Given the uncertain 
disk nature of LBGs at these redshifts, the detailed con- 
nection between size and A is not clear. 

We plan to extend our study to includ e other deep ACS 
fields (e.g. HUDF05. IO"esch et al.ll2007[) in future work to 
increase the sample size at the redshifts probed in this 
work. Furthermore, we will also extend our investiga- 
tions to lower redshifts to lengthen our redshift baseline. 
Size distributions from z < 3 are also crucial in determin- 
ing the functional form of size evolution as a function of 
redshift. However we are limited to the rest-frame UV 
for measuring the size distributions of LBGs at z > 5; 
high-resolution, rest-frame optical data for LBGs are not 
available until JWST comes online. With JWST, it wiU 
be possible to compare rest-frame optical sizes of local 
and high-redshift galaxies and eliminate the uncertain- 
ties introduced by morphological if-corrections. 

We are grateful to the referee for the comments that 
lead to significant improvements of this paper. We thank 
Michael Fall, Steve Finkelstein, Nimish Hathi, Mauro Gi- 
avalisco, Russell Ryan, Massimo Stiavelli, and Sara Sal- 
imbeni for useful discussions and help with the simula- 
tions. We also thank Chien Y. Peng for producing GAL- 
FIT. This work is supported by program number HST 
60-12060, which was provided by NASA through a grant 
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TABLE 4 

The Slope of the Size-Luminosity Relations and the Width of the 
Log-normal Size Distribution of Local Galaxies and LBGs at 2 ~ 4 

AND 5. 



Sample 



Reference 



Slope (/3)^ Width (o-infl) 



2 ~ 4 LBG 
2 ~ 5 LBG 

- late-type'^ 

- late-type'^ 

' early-type" 
2 ~ disk 
' disk-(-bulge 
~ disk only 



This work 
This work 
Shen et al. 2003' ^ 
Shen et al. 2003 
Shen eTaT 2003 
Courteau et al. 2007^ 
de Jong fc Laccv 2000^ 
de Jong &: Lacev 2000 



0.22 



+0.058 
-0.056 



0.25 



-t-0.15 
0.14 

0.261^ 

0.51 

0.65 
0.321 ± 0.010 
0.253 ± 0.020 
0.214 ±0.025 



0.83 



-1-0.046 



0.90 



0.044 
-HO. 15 
0.065 
0.45 

0.27 
0.45/0.27 

0.325 
0.28 ±0.02 
0.36 ±0.03 



Assuming B, oc L ^ . 
^ Galaxies with n < 2.5 and r-band absolute magnitude Mr > —20.91 mag. 

The quoted size is the r-band Sersic effective half-light radius. 

Galaxies with n < 2.5 and r-band absolute magnitude M^- < —20.91 mag. 

Galaxies with n > 2.5; the widths are the same as late- type galaxies in both 
the faint end and the bright end. 

' The quoted size is the /-band exponential scale length corrected for inclina- 
tion. 

^ The quoted size is the /-band half-light radius of an exponential light profile 
extrapolated to infinity. 

^ IShen et al.l (^003) did not give formal unc ertainties of the deriv ed slopes and 
scatter of their size-luminosity relations; iCourteau et al.l l|2007l ) did not give 
their formal uncertainty of the derived scatter. 



from the Space Telescope Science Institute, which is op- erated by the Association of Universities for Research in 

Astronomy, Inc., under NASA contract NAS5-26555. 
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APPENDIX 

CORRECTING FOR DROPOUT SELECTION AND GALFIT MEASUREMENT 

Applying Dropout- Selection Function P{M, R, z) 

Here we elaborate on the details of applying the dropout-selection function P{M, Re, z) on the size-luminosity 
distribution (Eq. [71) on a grid of (M, log^g -Re)- We also explain how we apply the dropout-selection function in the 
two fields separately, and how we combine the two fields to calculate the total likelihood. 

For clarity first we define several variables: 

• M: the intrinsic 1500 A absolute magnitude 

• TO and R: the intrinsic apparent magnitude and effective half-light radius (we will omit the subscript e in i?e 
for clarity of notations) 

Also the superscripts G stands for GOODS and H stands for HUDF in the equations that follow. 

In the GOODS-depth images, the kernel can be expressed as the selection function P^(M, logj^Q R, z), defined as the 
fraction of sources with input M, logj^g R, and z that is detected and selected as a dropout by the color criteria. In this 
study we calculate the selection function P'^{AI, log^Q R, z) in bins of (M, logj^g R, z), such that P^j. is the completeness 
with absolute magnitude in the range {Mi, Mi + AM), size in the range (logj^p logio + ^ logio ^)' ^^'^ redshift 
in the range {zk, Zk + Az). The bin size (AM, Alog^o R, Az) is selected to be (0.5, 0.2, 0.1), and we implicitly assume 
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that the selection function stays the same within the bin. Usuahy the dropout selection function _f^^ is only greater 

than zero within Az ^ 0.5 around the central redshift (z — A for _B435-dropouts and z = 5 for Vgoe-dropouts). 

The first step towards building the expected bivariate distribution of observed LBGs is to apply the dropout selection 
function P^j^ to the theoretical distribution (Eq. [7]). We define 5*, sans the normalization factor (/>*, on a grid of 

(Mp, logiQ ^q) where p and q are the indices of the "pixels". We choose the pixel size to be {dM , dlogig R) = (0.02,0.02). 
We apply the dropout-selection function in the following way: for each kernel within the range Mi < M < Mi + AM 
and logiQ Rj < log^g R < logio + ^ logio calculate the contribution to the number density of LBGs predicted 

by ^I' in this bin, weighted by the volume in each redshift interval and the dropout selection function P^^- In GOODS, 

ef^imr, logio R,) = E logio P^% ^ ^(^'Ip - DM(zfc) ^K^mr) (Al) 

p,k 

^^,,®P,% (A2) 



where 

( ^'(Mp, logio Rq) if < ^^p < + AM and 
{Mp, logio Rq) = < logio < logio Rq < logio Rj + A logio R (A3) 

[ otherwise 

and dV/dzk is the volume element within the surveyed area and the redshift range {zk,Zk + Az). The Dirac delta 
function S{Mp — DM(zfe) — K — rrir) facilitates the conversion from absolute magnitude M to apparent magnitude m 
so that the result is a distribution defined on the grid of {m,\ogiQ R) with the same pixel size {dm,d\ogiQ R) = 
(0.02, 0.02). In the argument of the delta function, DM(zfc) is the distance modulus to redshift Zk (calculated using the 
assumed cosmological parameters), and K is the fc-correction term that converts from rest-frame 1500 A magnitude to 
the apparent magnitude in the chosen band. Since we choose to use the bandpasses that are closest to rest-frame 1500 
A, the fc-correction term is in general very small. Thus Eq. lAll is the contribution to the distribution of (m, logiQ R) 
from the model within Mi < M < Mi + AM and log^o Rj < logio R < logio Rj + ^ logio R- Lastly, Eq. \M\ is a 
shorthand notation for the application of the dropout-selection function to the model ^. 

The resultant distribution of number density, combining \I' in all absolute-magnitude and effective-radius bins and 
after applying the dropout selection function, is 

9^{mr, logio -^s) = II ^^'j^™-' logio ^q) (A4) 
Similarly, we construct the distribution after applying the dropout-selection in the HUDF using the same procedure: 
ef^ {mr, logio Rq) = E ^viMp, logio Rq) P^k ^ S{Mp " DM(zfc) -K- m,) (A5) 



p,k 

'ij ^ ^ ijk 

0"(m,, logio Rq) - E ^^Mr' logio Rq) (A7) 



(A6) 



Applying GALFIT Transfer Functions 

The second set of transfer-function (TF) kernels quantifies the GALFIT-measurement bias, scatter, and completeness 
of GALFIT giving reasonably good fits. It relates the intrinsic apparent magnitude and effective half-light radius to 
the observed apparent magnitude and effective half-light radius. Therefore we define 

• to', R': the measured apparent magnitude and effective half-light radius 

• Sm — TOiYLcasurcd ^intrinsic ^ ^ 

• (5 logio R = logio -^measured - logio -^intrinsic = logio R' ~ logio R 

Again the superscripts G stands for GOODS and H stands for HUDF in the equations that follow. 

Similar to the case of dropout selection TF kernels, we use a bin size of 0.5 in to and 0.2 in logio R and calculate 
GALFIT TF kernels in each intrinsic magnitude and size bin, assuming that the kernel property is uniform within 
the bin. The GALFIT TF kernels in each bin are probability density functions (PDF) defined on a pixel grid of 
((5to, (5 logio i?) within pixel width (^to, d logio i?) = (0.02,0.02). The value of the GALFIT TF kernel at (Jto, (5 logio i?) 
is the probability that the measured magnitude of the object is off by an amount of 6m, and the size is off by an amount 
d logio R to measurement. In the i'-th apparent magnitude bin and the j'-th size bin, in which to^' < to < to^' -I- Ato 
and logio Rj' < logio R < logio + A logio R^ the GALFIT TF kernel in GOODS is denoted as T^^'^{Sm,Slogj^o R). 
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Since each kernel T^^''~^ {5m, 6 \ogiQ R) only applies in its own bin, we calculate the contribution to the expected 



distribution of observed magnitude and size in each bin as 

(TOs' logio R'l) = (™'-' logio Rq) - mr, log^o R'l - logio Rq) 



q,r 

- aG 



/ / ^ 1 -I 



where 

^ otherwise 
The distribution of (m',log]^Q R'), after applying the GALFIT transfer functions in all bins, is the sum of "0,^,/: 



^'^{nir, logj^o Rq) if "^i' — "^i' < "^i' + ^'^d 

logio Rj' < logio -Rq < logio Rj' + A login R 



(A8a) 
(A8b) 

(A9) 



^^K, logio ^0 - E '/•'^J' K' logio ^0 



as 



(AlO) 

Similarly in the HUDF, for the same dropout sample, one can calculate the expected distribution of (m', log^Q i?') 

, (m;, logio = E ' ("^^ logio Rq) T;°/'"K - "^r, logio R'l - logio Rq) (Alia) 

(Allb) 
(A12) 



q,r 



^'"(m^, logio R'l) = E '/^"j ' logio ^0 



Eqs lASb land lAlfblsay t hat the a pplication of TF kernels is equivalent to convolution of 6 and T"-^^ on a discrete grid 
in each bin. And Eqs I AlO l and I Al 21 sav that the final distribution is the sum of the contributions from all (m', logio -^0 
bins. 

Estimated Interloper Contribution 

We estimate the contribution from interlopers due to photometric scatter by combining the estimated interloper 
fraction as a function of apparent magnitude and size in Figure [2] We do not know exactly the size distribution of 
the interlopers, so we use the size distribution of all sources in the field as an approximation. In the GOODS fields, if 
the size distribution of all non-dropout sources is S'^{R'i), and the interloper number density as a function of apparent 
magnitude is I'^'-"^^^(m'j,), then the interloper contribution is 

/«K,i?;)=iGK)5G(i?;) 

Similarly, in HUDF 

/«K,i?j)=i"K,)5«(i?;) 



(A13) 
(A14) 



The Resultant Bivariate Size-Luminosity Distribution 

After accounting for dropout-selection and GALFIT-measurement, the expected distributions on a discrete pixel 
grid in the GOODS and HUDF are constructed as 



V'^K, logio i?0 = E 



V'^K, logio i?;) = E 



ijk 



ijk 



T,GF,G 



^GF.H 



(A15) 



We add the interloper contributions fSection lA.3l) /"^ and to ■0'^ and ■0^ to compare with the measured magnitudes 
and sizes of our LEG sample and derive the best-fit parameters. 



